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G PROTEIN-COUPLED GLUTAMATE RECEPTORS 



BagKgrwnfl <?t the ^ve^oq 

The majority of nerve cell connections are 
chemical synapses. A neurotransmitter is released from 
the presynaptic terminal, typically in response to the 
arrival of an action potential in the neuron, and 
diffuses across the synaptic space to bind to membrane 
receptor proteins of the postsynaptic terminal. The 
binding of neurotransmitters to membrane receptors is 
coupled either to the generation of a permeability change 
in the postsynaptic cell or to metabolic changes. 

Neurotransmitters produce different effects 
according to the type of receptor to which they bind, in 
general, those which produce effects that are rapid in 
onset and brief in duration bind to receptors that act as 
ligand-gated ion channels, where binding almost instantly 
causes an ion flow across the membrane of the 
postsynaptic cell. Those neurotransmitters which act 
more like local chemical mediators bind to receptors that 
are coupled to intracellular enzymes, thereby producing 
effects that are slower in onset and more prolonged. 
These neurotransmitters alter the concentration of 
intracellular second messengers in the postsynaptic cell. 

Four second messenger systems have been linked 
to neurotransmitter or hormone receptors and have been 
studied for their roles in the control of neuronal 
excitability. They are th adenylate cyclase/cyclic AMP- 
d pend ht protein kinase system, guanylat cyclase and 
cGMP-d pendent prot in kinas , the in sit 1 
trisph sphate/diacyl glycer 1-pr t in kinase C system, 
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and systems which are activated by calcium ions, such as 
th calcium/calmodulin-dependent prot in kinas system. 
Thus, binding^ of a transmitter to a receptor may 
activate, for example, adenylate cyclase, thereby 
increasing th£ intracellular concentration of cAMP. The 
cAMP activates protein kinases that phosphorylate 
proteins in the cells, which form ion channels, thereby 
altering the cells 1 electrical behavior. As with the 
ligand-gated ion channel transmitters, the effects can be 
either, excitatory or inhibitory, and may affect the cell 
at many levels; including the pattern of gene expression. 
It is also believed that these chemical synapses, 
associated with second-messenger systems, may be involved 
in long-term changes that comprise the cellular basis of 
learning and memory. 

The ligand-activated membrane receptors do not 
activate the second messenger systems directly, however, 
but via a membrane-bound protein , the GTP-binding protein 
(G protein) , which binds GTP on the cytoplasmic surface 
of the cell membrane and thereby acts to couple adenylate 
cyclase to the membrane receptor. Neurotransmitter 
binding to the membrane receptor is believed to alter the 
conformation of the receptor protein to enable it to 
activate the G protein in the lipid bilayer, which then 
binds GTP at the cytoplasmic surface and produces a 
further \ change in the G protein to allow it to activate, 
e.g., ah adenylate cyclase molecule to synthesize cAMP. 
When a iigand binds a receptor, an enzymatic cascade 
results as each receptor activates several molecules of G 
protein, which in turn activate more molecules of 
adenylate cyclase which convert an even larger number of 
ATPs to cAMP molecules, producing a substantial 
amplification from the initial event. 

I Glutamate, aspartate and their endogenous 
derivatives ar believed to be th predominant excitat ry 
n urotransmitters in the vertebrate central nervous 
system. • (Krinjrvic, Phvs. Rev. 54:418-540, 1974). 
Recently, glutamate has be n d scribed as playing a 
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major, widespr ad role in the control of neuroendocrin 
neurons, possibly controlling not only the neuroendocrine 
system but other hypothalamic regions as well. Four 
major subclasses of glutamate receptors have been 
described but their characterization has until recently 
been limited to pharmacological and electrophysiological 
functional analyses. See generally , Hollman et al., 
Nature 342:643-648 (1989) and Sommer et al., science 
249:1580-1585 (1990). Three of the receptors, the 
quisqualate (QA/AMPA) , kainate (KA) , and N-methyl-D- 
aspartate (NMDA) receptors, are believed to be directly 
coupled to cation-specific ion channels and thus are 
classified as ligandr-gated iono tropic receptors. The 
fourth glutamate receptor binds some of the agonists of 
the ionotropic receptors (quisqualate and glutamate, but 
not AMP A) but has no shared antagonists, and is coupled 
to G protein. Thus, this receptor, referred to as the G 
protein-coupled glutamate receptor, or GlUgR, is 
pharmacologically and functionally distinct from the 
other major glutamate receptors. This receptor has also 
been termed the metabotropic receptor. 

Agonist binding to GlUgR has been shown to 
result in the activation of a number of second messenger 
systems, depending on the system studied. One of the 
best characterized is the quisqualate activation of 
phospholipase C through a G protein-coupled interaction 
that leads to the stimulation of inositol phospholipid 
metabolism. This activity has been studied in systems 
that measure the accumulation of radiolabeled inositol 
monophosphate in response to stimulation by glutamate. 
The systems typically use brain slices from regions such 
as the hippocampus, striatum, cerebral cortex and 
hypothalamus (Nicoletti, et al., Proc, Natl. Acad. Sci. 
USA 83:1931-1935 (1986), and Nicoletti , et al . , iL_ 
Neurochem . 46:40-46 (1986)), neur nal cultures derived 
from embry nic mouse and rat cerebellum, corpus striatum 
and cerebral crtx (Nicoletti et al. , ? t NWffPffcj. 
6:1905-1911 (1986), Sladeczek et al., HaiHES 317:717-719 
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(1985)/ Dumuis, et al., Ka£a£g. 347:182-184 (1990) , and 
Drejeretal., J. Neuroscl . 7:2910-2916 (1987)) and rat 
brain Bynaptosomes (Recasens et al., pur. Ji PhsnB' 141: 
87-93 (1987) , and Recasens et al. , Neurochem. Int. » 
5 13:463-467 (1988) ) . A major disadvantage of each of 

these model systems is the difficulty in analyzing the 
pharmacological and functional activities of GIUqR in an 
environment where other glutamate receptors and G 
protein-coupled receptors such as muscarinic and 

10 serotonin receptors are also present. 

.The Xenoous oocyte system has been used to . 
identify GlUgR as a member of the family of G protein- 
coupled receptors. An endogenous inositol triphosphate 
second messenger-mediated pathway in the oocyte allows 

15 the detection of GlUgR after injection of total rat brain 

mRNA, in that the oocyte responds to ligand via the 
oocyte G protein-coupled PLC-mediated activation of a 
chloride channel that can be detected as a delayed, 
oscillatory current by voltage-clamp recording (Houamed 

20 et al. . Nature 310:318-321 (1984), Gunderson et al. , 

Proc. Roval Soc. B221: 127-143 (1984) , Dascal et al . JSSlLm. 
Brain Res. 1:301-309 (1986), Verdoorn et al., Science 
238:1114-1116 (1987), Sugiyama et al . MfltUES 325:531-533 

(1987), Hirono et al. . Neuros. Res. 6:106-114 (1988), 
25. var^r^™ n<n ? T»lfifnftr Mol. Pharmacol. 34:298-307 

; \ (1988), and Sugiyama et al. , Neuron 3:129-132 (1989)). 
Injection of region-specific brain mRNA and of size 
. fractionated mRNA have suggested that GlUgR may be a large 
mRNA (6-7 kb) and that it is enriched in the cerebellum 
30 (Fong et al., Svnapse 2:657^665 (1988) and Horikoshi et 

al. ... Neurosci. Lett. 105:340-343 (1989)) . . 
There remains considerable need in the art for 

isolated and purified GlUgR, as well as systems capable of ^ 
express ing.GlUgR separate from other neurotransmitter 
35 receptors. Further, it would be desirabl to 

specif ^ically "identify th p^ 

tissues, thereby av iding th time-consuming, complex and 
nonspecific functional electrophysiological and 



Pharmacol gical assays. It would also be desirable to 
screen and develop new agonists and/ or antagonists 
specific for GlUgR, but to date this has not been 
practical. Quite surprisingly, the present invention 
fulfills these and other related needs. 

Summary of the Invention 
The present invention provides isolated and 
substantially pure preparations of mammalian G protein- 
coupled glutamate receptors and fragments thereof. In 
preferred embodiments the receptors are coupled to a G 
protein in vertebrate cells, bind glutamate and 
quisqualate and thereby activate phospholipase C, and are 
capable of stimulating inositol phospholipid metabolism. 
Having provided such receptors in isolated and purified 
form, the invention also provides antibodies to the N 
receptors, in the form of antisera and/or monoclonal 
antibodies. 

In another aspect the invention provides the 
ability to produce the mammalian G protein-coupled 
glutamate receptors and polypeptides or fragments thereof 
by recombinant means, preferably in cultured eukaryotic 
cells. The expressed receptors or fragments may or may 
not have the biological activity of corresponding native 
receptors, and may or may not be coupled to a G protein 
in the cell used for expression. Accordingly, isolated 
and purified polynucleotides are described which code for 
the receptors and fragments thereof, where the 
polynucleotides may be in the form of DNA, such as cDNA, 
or SNA. Based on these sequences probes may be used to 
hybridize and identify these and related genes which 
encode mammalian G protein-coupled glutamate receptors. 
The probes may be full length cDNA or as small as from 14 
to 25 nucleotide, more of ten though from about 40 to 
ab ut 50 r m re nucle tides. 

In related emb diments the invent! n c ncerns 
DNA c nstructs which c mprise a transcript! nal pr m t r, 
a DNA sequenc which nc des the r c pt r r fragm nt, 
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and a transcriptional terminator, each operably linked 
for expression of the r cept r. For expression the 
construct may also contain at least one signal sequence. 
The constructs are preferably used to transform or 
5 transfect eukaryotic cells, more preferably mammalian 

cells which do not express endogenous G protein-coupled 
glutamate receptors. When bound by an appropriate ligand 
such as glutamate or quisqualate, the receptor may 
activate phospholipase C in the host cell via coupling to 

10 G protein. Further, for large scale production the 

expressed receptor may also be isolated from the cells 
by, for example , iramunoaffinity purification. 

Cells which express the G protein-coupled 
glutamate receptors may also be used to identify 

15 compounds which can alter the receptor-mediated 

metabolism of a eukaryotic cell. Compounds may be 
screened for binding to the receptor, and/ or for 
effecting a change in receptor-mediated metabolism in the 
host cell. Agonists and/ or antagonists of the G protein- 

20 coupled glutamate receptors may also be screened in cell- 

free systems Using purified receptors or binding 
fragments thereof for the effect on ligand-receptor 
interaction, or using reconstituted systems such as 
micelles which also provide the ability to assess 

25 metabolic changes. 

In yet other embodiments the invention relates 
to methods for diagnosis f where the presence of a 
mammalian G protein-coupled glutamate receptor in a 
biological sample may be determined. For example, a 

30 monospecific antibody which specifically binds a G 

protein-coupled glutamate receptor is incubated with the 
sample under conditions conducive to immune complex 
formation/ which complexes are then detected, typically 
by means of a label such as an enzyme, fluorophore, 

35 radionuclide, chemiluminescer, particle, or a second 

labeled antibody* Thus, means are provided for 
immunohistochemical staining of tissues, including brain 
tissu s, f r the subject receptors. 
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Brief Description of the Figures 

5 Figure 1 illustrates the construction of 

plasmid pVEGT, where Fig. 1A shows the construction of 
\ pVEG; Fig. IB shows the construction of pVEG 1 and Fig. 1C 
shows pVEGT 1 Symbols used are T7 pro, the T7 promoter; 
Tl arid T2, synthetic and native T7 terminators, 
10 respectively; M13, M13 intergenic region; the parentheses 

indicate a restriction site destroyed in vector 
construction; and pA is the Aspergillus niqer 
polyadenylate sequence. 

Figure 2 illustrates representative responses 

15 from voltage-clamp assays of oocytes injected with RNA 

from positive pools. 

Figure 3 illustrates a partial restriction map 

of clone 45-A. 

Figure 4 illustrates the cloning of the 
20 receptor cDNA present in clone 45-A into Zem228R. 

Figure 5 illustrates the DNA sequence and 
deduced amino acid sequence of clone 4 5-A (corresponding 
to Sequence ID Nos. 1 and 2) . Numbers below the line 
refer I to amino acid sequence, numbers above the line 
25 refer to nucleotide number. Putative transmembrane 

domains have been overlined, and putative N-l inked 
glycosylation sites are indicated by closed circles. 

5 Figure 6 illustrates a representative dose 
response curve for varying concentrationis af T>glutamic 
30 acid. Error bars, where larger than the symbols, 

represent SEM. 

! Figure 7 illustrates the DNA sequence and 
deduced amino acid sequence of a subtype lb glutamate 
receptor clone (Sequence ID Nos. 16 and 17). Numbers 
35 below the lin refer to amin acid sequence. Numbers 

ab ve the lin refer to nucleotid sequence. 

Figur 8 illustrates the DNA sequ nee and 
deduced amino acid sequence of a subtype 2 a glutamate 

SUBSTITUTE SHEET 
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receptor cl ne (Sequence ID Nos. 18 and 19) . Numb rs 
below th line refer to amino acid sequence. Numbers 
above the line refer to nucleotide sequence . 

Figure 9 illustrates the DNA sequence of a 
partial subtype 2b glutamate receptor clone (Sequence ID 
No. 20) . Numbers refer to the nucleotide sequence. 



Description 'at the Specific Embodiments 

GlUgR is a famiiy of 6 protein-coupled membrane 
receptors for the neurotransmitter glutamate. As 
glutamate has been described as having a major role in 
the control of neurons, particularly neuroendocrine 
neurons, Glu G R may play a critical role in effectuating 
such control. Consequently, the development of agonists 
and antagonists of the GlUgR-ligand interaction and 
GlUgR-mediated metabolism is of great interest. 

The present invention presents the means to 
identify agonists and antagonists of the GlUgR-ligand 
interaction by providing isolated GlUgR. The term 
"GlUgR" refers to any protein either derived from a 
naturally occurring GIUqR, or which shares significant 
structural and functional characteristics peculiar to a 
naturally occurring GlUgR. Such a receptor may result 
when regions of a naturally occurring receptor are 
deleted or replaced in such a manner as to yield a 
protein having a similar function. Homologous sequences, 
allelic variations, and natural mutants; induced point, 
deletion, and insertion mutants; alternatively expressed 
variants; proteins encoded by DNA which hybridize under 
high or low stringency conditions to nucleic acids which 
encode naturally occurring Glu^R-encoding nucleic acids; 
proteins retriev d from naturally occurring mat rials; 
and clos ly relat d proteins retrieved by antisera 
directed against GlUgR proteins ar also included. 
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analog, or chimeric Glu^ as gen rally described in U.S. 
Pat. No. 4,859,609, incorporated by reference herein. 
The molecule may be chemically synthesized or may occur 
in nature. Ligands may be grouped into agonists and 
5 antagonists. Agonists are those molecules whose binding 

to a receptor induces the response pathway within a cell. 
Antagonists are those molecules whose binding to a 
receptor blocks the response pathway within a cell. 

By "isolated" GIUqR is meant to refer to a GlUgR 

10 which is in other than its native environment such as a 

neuron, including, for example, substantially pure Glu^R 
as defined here inbe low. More generally, isolated is 
meant to include a GliigR as a heterologous component of a 
cell or other system. For example, a GlUgR may be 

15 expressed by a cell transfected with a DNA construct 

which encodes the GlUgR, separated from the cell and added 
to micelles which contain other selected receptors. In 
another example described below, a GlUgR is expressed by a 
cell which has been co-trans fected with a gene encoding 

20 muscarinic receptor. Thus, in this context, the . „ 

environment of the isolated GlUgR is not as it occurs in 
its native state, particularly when it is present in a 
system as an exogenous component. 

The invention provides cloned GlUgR coding 

25 sequences which are capable of expressing GlUgR proteins. 

Complementary ONA encoding GlUgR may be obtained by 
constructing a cDNA library from mRNA from, for example, 
brain tissue. The library may be screened by 
transcribing the library and injecting the resulting mRNA 

30 into oocytes and detecting, by functional assays, those 

oocytes which express the Glu^. Alternatively, the 
clones may be screened with a complementary labeled 
oligonucleotide probe. 

The present invention relates to successfully 

35 isolating a cDNA encoding a GIUqR. Functional cloning of 

GlUgR was acc mplish d by substantial modifications and 
improvem nts to a number of cONA cloning and molecular 
biology techniques. Initially, an enriched source of 
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GlUgR mRNA prepared by sucr se gradi nt centrifugation of 
>4kb length rat cerebellum poly (A) + mRNA was used as 
template for cDNA synthesis. Further, a cDNA cloning 
vector that was employed included a poly (A) tail, thereby 
increasing by 40-fold the translational efficiency of the 
transcription product of the cDNA insert and a poly linker 
site to allow the directional cloning of the cDNA into 
the vector between the promoter and the poly (A) tail. 
Vector construction for directional cloning is described 
in co-pending U.S. S.N. 07/320, 191, incorporated herein by 
reference. The cDNA cloning vector also was used with 
two transcriptional terminators, in tandem, following the 
poly (A) sequences, efficiently generating a unit length 
transcript product without non-coding plasmid or viral 
sequences, and without requiring a restriction 
endonuclease to linearize the DNA template (a standard 
practice that will often prevent functional cloning 
strategies from working due to the presence of the 
endonuclease site within the coding region of the cDNA) . 
The cDNA synthesis strategy maximized insert size and 
recreation of the 5* ends of the cpNA^, without 
introduction of homopolymer tails. cDNA inserts were 
size-selected to be greater than 4 kb in length before 
insertion into the vector. A library of 10 6 cDNA inserts 
in pools of 100,000 was replica plated to reduce the 
number of amplification steps in the fractionation of 
sequentially smaller pools. Moreover, ml muscarinic cDNA 
(another 6 protein-coupled receptor coupled to 
phosphoinositol metabolism) template was included in 
transcription reactions of the subfractionated pools so 
that before injection the in YiiEfi transcripts from each 
pool could be assayed by Northern analysis to assess 
relative quantity and quality of the mRNA, and by 
voltage-clamp of oocytes as an internal positive control 
for ach oocyte not r sponding to quisqualate or 
glutamat . The inclusion of a dilution of SEAP-VEGT 1 (a 
secreted form of alkaline phosphatas .) template in 
transcriptions was also employed so that oocytes selected 
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for voltage-clamp analysis w r thos synthesizing higher 
levels of the co-injected GIUqR mRNA. And further, low 
noise electrical recording techniques were used to 
monitor the small signals initially generated from rare 
transcripts. 

The above-described methods were used to 
isolate a cDNA clone encoding a Glu^R designated "subtype 
la." Oligonucleotide probes based on the sequence of the 
subtype la clone were used to probe additional brain and 
cerebellum cDNA libraries. These libraries yielded 
clones encoding additional subtypes, which were 
designated lb, 2a and 2b. 

With the GlUgR and cDNA clones thereof provided 
herein, nucleotide and amino acid sequences may be 
determined by conventional means, such as by dideoxy 
sequencing. See generally, Sambrook et al., Molecular 
Cloning. A Laboratory Manual . 2d ed., Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY, 1989, 
incorporated by reference herein./ Genomic or cDNA 
sequences encoding GlUgR and homologous receptors of this 
family may be obtained from libraries prepared from other 
mammalian species according to well known procedures. 
For instance, using oligonucleotide probes from rodent 
GlUgR, such as whole length cDNA or shorter probes of at 
least about fourteen nucleotides to twenty-five or more 
nucleotides in length; often as many as 40 to 50 
nucleotides, DNA sequences encoding Glu^ of other 
mammalian species, such as lagomorph, avian, bovine, 
porcine, murine, etc. may be obtained. If partial clones 
are obtained, it is necessary to join them in proper ■ 
reading frame to produce a full length clone, using such 
techniques as endonuclease cleavage, ligation and loop out 
mutagenesis. 

A DNA sequence encoding GlUgR is inserted into a 
suitable expression vector, which in turn is used to 
transfect eukaryotic cells. Expression v ctors f r use 
in carrying out the pr s nt invention will comprise a 
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promoter capable of directing the transcription of a 
cloned DNA and a transcriptional terminator. 

To direct proteins of the present invention for 
transport to the plasma membrane, at least one signal 
sequence is operably linked to the DNA sequence of 
interest* The signal sequence may be derived from the 
GlUgR coding sequence, from other signal sequences 
described in the art, or synthesized de nSJ£2. 

':']•" Host cells for use in practicing the present 
invention include mammalian, avian, plant, insect and 
fungal! cells, but preferably, mammalian cells. Fungal 
cells/; including species of yeast (e.g. , Sa,cch,aromyces 
spp., particularly £^ c^revlsiflS, Schjz^sa^charofflyceg 
spp.) or filamentous fungi (e.g., Aspergillus spp., 
Neurospora spp. ) may be used as host cells within the 
present invention, suitable yeast vectors for use in the 
present invention include YRp7 (Struhl et al., Proc, 
Natl. Acad. Scl. USA 76: 1035-1039, 1978), YEpl3 (Broach 
et al. , Gene 8: 121-133, 1979) , POT vectors (Kawasaki et 
al, U.S. Patent No. 4,931,373, which is incorporated by 
reference herein), pJDB249 and pJDB219 (Beggs, Mature 
275:104-108, 1978) and derivatives thereof . Such vectors 
will generally include a selectable marker, which may be 
one of any number of genes that exhibit a dominant 
phenotype for which a phenotypic assay exists to enable 
■bransfotmahts to be selected. Preferred selectable 
markers! are those that complement host cell auxotrophy, 
provide! antibiotic resistance or enable a cell to utilize 
specific carbon sources, and include JU3I2 (Broach et al., 
ibid.), URA3 (Botstein et al., Gene 8: 17, 1979), HIS2 
(Struhl ;«t. al . , ibid. ) or £QX1 (Kawasaki et al. , ibid. ) . 
Another suitable selectable marker is the £&I gene, which 
confers chloramphenicol resistance on yeast cells. 

i Additional vectors, promoters and terminators 
for use in xpressing the receptor of th invention in 
yeast are well known in the art and are reviewed by, for 

avaM p 1o ^ T*nr. r Math. Enzvmol. 185:231-279, (1990), 
incorpoifated herein by reference. Th receptors of the 
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invention may be expressed in Aspergillus spp. (McKnight 
and Upshall* describ d in U.S. Pat nt 4,935,349, which is 
incorporated herein by reference) . Useful promoters 
include those derived from Aspergillus nidulans 
glycolytic genes, such as the ADH3 promoter (McKnight et 
al., EMBO J. 4:2093-2099, 1985) and the £eiA promoter. 
An example of a suitable terminator is the ADH3 
terminator (McKnight et al. , ibid.). Techniques for 
transforming fungi are well known in the literature, and 
have been described, for instance by Beggs (ibid.), 
Hinnen et al. f Proc. Natl. Acad. Sci. USA 75:1929-1933, 
1978), Yelton et al. ( Proc. Natl - Acad, Sci. USA 81:1740- 
1747, 1984), and Russell ( Nature 301:167-169, 1983) each 
of which are incorporated herein by reference. 

A variety of higher eukary otic cells may serve 
as host cells for expression of the GliigR, although not 
all cell lines will be capable of functional coupling of 
the receptor to the cell's second messenger systems. 
Cultured mammalian cells, such as BHK, CHO, Yl (Shapiro 
et al., TIPS SudpI. 43-46 (1989)), NG108-15 (Dawson et 
al. / Neurosclence Approached Through Cell Culture, Vol. 
2, pages 89-114 (1989)), N1E-115 (Liles et al . , J> Biol. 
Chem. 261:5307-5313 (1986)), PC 12 and COS-1 (ATCC CRL. 
1650) are preferred. Preferred BHK cell lines are the tk" 
tsl3 BHK cell line (Waechter and Baserga, Proc. Natl. 
Acad. Sci. USA 79; 1106-1110 ( 1982V) and the BHK 570 cell 
line (deposited with the American Type Culture 
Collection, 12301 Parklawn Dr. , Rockville, MD. under 
accession number CRL 10314) . A tk" BHK cell line is 
available from the ATCC under accession number CRL 1632. 

Mammalian expression vectors for use in 
carrying put the present invention will include a 
promoter capable of directing the transcription of a 
cloned gene or cDNA. Preferred promoters include viral 
promoters and c llular promoters. Viral promoters 
includ th imm diat arly cytomegalovirus promoter 
(Boshart et al., Cell 41: 521-530, 1985) and the SV40 
promoter (Sxibramani et al., Mol. Cell. Biol. 1: 854-864, 
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1981). C llular promoters include the mouse 

metallothion in-1 promoter (Palmiter et al. , U.S. Pat nt 

No. 4,579,821),. a mouse promoter (Bergman et al., 

• prool Natl. A^d. sel. PSA 81: 7041-7045, 1983; Grant et 

5 wue. Adds Res. 15; 5496, 1987) and a mouse V M 

promoter (Loh et al., Cell 33: 85-93, 1983). A 
particularly preferred promoter is the major late 

promoter from Adenovirus 2 (Kaufman and Sharp, H2Ll_ Clll*. 
Biol . 2: 1304-13199, 1982) . Such expression vectors may 

10 also contain a set of RNA splice sites located downstream 

from the promoter and upstream from the DNA sequence 
encoding the peptide or protein of interest. Preferred 
RNA splice sites may be obtained from adenovirus and/or 
immunoglobulin genes. 

15 Also contained in the expression vectors is a 

polyadenylation signal located downstream of the coding 
sequence of interest. Polyadenylation signals include 

the early or late polyadenylation signals from SV40 
(Kaufman and Sharp, ibid.), the polyadenylation signal 

20 from the Adenovirus 5 E1B region and the human growth 

hormone gene terminator (DeNoto et al. , NyCt Afftfls Rffs . 
9: 3719-3730, 1981). The expression vectors may include 
a hbncoding viral leader sequence, such as the Adenovirus 
2 tripartite leader, located between the promoter and the 

25 RNA splice sites. Preferred vectors may also include 

enhancer sequences, such as the SV40 enhancer and the 
mouse /t enhancer (Gillies, Cell 33: 717-728, 1983). 
Expression vectors may also include sequences encoding 

the adenovirus VA RNAs. 
30 Cloned DNA sequences maybe introduced into 

cultured mammalian cells by, for example, calcium 
phosphate-mediated transfection (Wigler et al., fisH 14: 
725, l n 79 ^r aaw 9m»-r*m.'- Somatic Cell Genetics 7: 

603, 1981; Graham and Van der Eb, viroloov 52: 456, 
25 1973.) Other techniques for introducing clon d DNA 

s quences into mammalian cells, such as electroporation 
(Neumann et al. , embo j . 1: 841-845, 1982), may also be 
us d. in ord r to identify cells that have integrated 



the cloned DNA, a sel ctable mark r is generally 
introduced into the cells along with* the gene or cDNA of 
interest. Preferred selectable markers for use in 
cultured mammalian cells include genes that confer 
resistance to drugs, such as neomycin, hygromycin, and 
methotrexate . The selectable marker may .be an 
amplifiable selectable markets Preferred amplifiable 
selectable markers are the DBFR gene and the neomycin 
resistance gene . Selectable markers are reviewed by 
Thilly (Mammalian Cell Technology . Butterworth 
Publishers, Stoneham, MA, which is incorporated herein by 
reference). The choice of selectable markers is well 
within the level of ordinary skill in the art. 

Selectable markers may be introduced into the 
cell on a separate plasmid at the same time as the gene 
of interest , or they may be introduced on the same 
plasmid. If on the same plasmid, the selectable marker 
and the gene of interest may be under the control of 
different promoters or the same prompter, the latter 
arrangement producing a dicistronic message. Constructs 
of this type are known in the art (for example, Levinson 
and Simohsen, U.S. Patent No. 4 ,713, 339) . It may also be 
advantageous to add additional DNA, known as "carrier 
DNA M to the mixture which is introduced into the cells. 

Trans fected mammalian cells are allowed to grow 
for a period of time, typically 1-2 days, to begin 
expressing the DNA sequence (s) of interest. Drug 
selection is then applied to select for growth of cells 
that are expressing the selectable marker in a stable 
fashion. Trans fected cells may also be selected in the 
presence of antagonist to inhibit the activity of the 
receptor. Suitable antagonists in this context include 
D, L, 2 -amino-3 -phosphonopropionate . For cells that have 
been transfected with an amplifiable selectable marker 
the drug cone ntration may be incr as d in a st pwise 
manner to sel ct for incr ased copy number of the cloned 
sequences, thereby increasing xpression levels. 
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Promoters, terminators and methods suitable for 
introducing expr ssion vectors encoding recombinant GIUqR 
into plant, avian and insect cells are known in the art. 
the use of baculoviruses, for example, as vectors for 
5 expressing heterologous DNA sequences in insect cells has 

been reviewed by Atkinson etal. rPeatic. Sci.. 28: 
215-224 , 1990) • The use of ,ftep;obact:arium rhiqoqen?g as 
vectors for expressing genes in plant cells has been 
reviewed by Sinkar et al. (J... Bioscr, rRancrlaoreV 11: 

10 47-58, 1987) . 

Host cells containing DNA constructs of the 
present invention are then cultured to produce 
recombinant GlUgR/ the cells are cultured according to 
accepted methods in a culture medium containing nutrients 

15 required for growth of mammalian or other host cells. A 

variety of suitable media are known in the art and 
generally include a carbon source, a nitrogen source, 
essential amino acids, vitamins, minerals and growth 
factors/ The growth medium will generally select for 

20 cells containing the DNA construct by, for example, drug 

selection or deficiency in an essential nutrient which is 
complemented by the selectable marker on the DNA 
construct or co-transf ected with the DNA construct. 

Transf ected cells expressing a cloned GlUgR can 

25 be detected by several methods. By trarisfecting cells 

with an expression vector containing expression units for 
both the GIUqR and a reporter gene (e.g. lucif erase ), the 
activity of the reporter gene provides an indicator of 
expression of the cotransf ected GlUgR clone. By including 

30 one or more cyclic AMP response elements (CRE) in the 

reporter gene expression unit, clones encoding receptors 
coupled to either the stimulation or inhibition of the 
second messenger adenylate cyclase can be identified by a 
change in reporter gene expression in response to added 

35/ ligand. DNA constructs comprising a linked CRE and 

r porter gene ar known in the art. See, for example, 
Mellon et al , ^ P't 1 f sci- ™A 86: 4887-4891 
(1989), incorporated herein by reference. Cell lines 
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expr ssing functional recept rs can also be detected by 
el ctrophysiological measurements of agonist- induced 
channel activity. Receptor activity can also be assayed 
by measuring cytosolic free calcium concentrations in 
trans fee ted cells. See, for example, Thastrup et ah, 
Proc. Natl. Acad. Sci. USA 87: 2466-2470 fl990V and 
Picard et all. ; Science 247; 327-329 (1990) , which are 
incorporated! herein by reference . A preferred method for 
measuring cytosolic free calcium is by scanning cells 
with a fluorescent microscope coupled to a video camera. 
The cells are injected with a fluorescent Car* indicator 
(e.g. Fluo-3 or Fura-2, Molecular Probes, Inc., Eugene, 
OR) and exposed to ligand. 

The GlUgR produced according to the present 
invention may be purified from the recombinant expression 
systems or other sources using purification protocols 
that employ techniques generally available to those 
skilled in the art. The most convenient sources for 
obtaining large quantities of GlUgR are cells which 
express the recombinant receptor. However, other . 
sources, such as tissues, particularly brain tissues of 
the cerebellum which contain GlUgR, may also be employed. 

Purification may be achieved by conventional 
chemical purification means, such as liquid 
chromatography, lectin affinity chromatography, gradient 
centrifugation, and gel electrophoresis, among others. 
Methods of protein purification are known in the art (see 
generally, Scopes, R. . Protein Purification. 
Springer-Verlag, NY (1982), which is incorporated herein 
by reference) and may be applied to the purification of 
the GlUgR and particularly the recombinant ly produced 
GlUgR described herein. In a preferred embodiment 
immunoaf finity chromatography is employed using 
antibodies directed against GlUgR as herein described. In 
another method of purification, a r combinant g ne 
encoding GIUqR or portions ther of can be modified at the 
amino terminus, just behind a signal peptide, with a 
sequence coding for a small hydrophilic peptid , such as 
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described in U.S. Patent Nos. 4,703,004 and 4,782,137, 
incorporated herein by referenc . Specific antibodies 
for the peptide facilitate rapid purification of GlUgR, 
and the short peptide can then be removed with 

enterokinase. ' : . ' f 

Thus, as discussed above, the present invention 

provides GlUgR isolated from its natural cellular 
environment, substantially free of other G protein- 
coupled glutamate receptors. Purified GlUgR is also 
provided. Substantially pure GlUgR of at least about 50% 
is preferred, at least about 70-80% more preferred, and 
95-99% or more homogeneity most preferred, particularly 
for pharmaceutical uses. Once purified, partially or to 
homogeneity, as desired, the recombinant GlUgR or native 
GlUgR may then be used to generate antibodies, in assay 

procedures, etc.. 

In another aspect* the invention concerns 
polypeptides and fragments of GlUgR. Polypeptides and 
fragments of Glu^R may be isolated from recombinant 
. expression systems or may be synthesized by the solid 
phase method of Merrifield, Fed.' Proc. 21:412 (1962) , 
Merrifield, *™- rfrem. Soc. 85 : 2 149 ( 19 63), or Barany 
and Merrifield, in The Peptides, vol. 2, pp. 1-284 (1979) 
Academic Press, NY, each of which are incorporated herein 
by reference, or by use of an automated peptide 
. synthesizer. By ••polypeptides" is meant a sequence of at 
: least about 3 amino acids, typically 6 or more, up to 
100-200 amino acids or more, including entire proteins. 
For example, the portion (s) of GlUgR proteins which bind 
ligand may be identified by a variety of methods, such as 
by treating purified receptor with a protease or a 
chemical, agent to fragment it and determine which 
fragment is able to bind to labeled glutamate in a 1 igand 
blot. Polypeptides may then be synthesized and used as 
antigen, to inhibit ligand-GlUgR interaction, etc. It , 
should be und rstood taiaVa^^ reference to 

. GlUjjR is meant, to include the proteins , polypeptides, and 
fragm nts thereof unless th context indicates oth rwis . 



WO 92/10583 



PCT/US91/09422 



In another aspect, th invention provides means 
for regulating the Glxi^-Iigand interaction, and thus 
treating , therapeutically and/ or prophylactically , a 
disorder which can be linked directly or indirectly to a 
5 GIUqR or to its ligands, such as glutamate and other 

endogenous excitatory amino acids. By virtue of having 
the receptors of the invention, agonists or antagonists 
may be identified which stimulate or inhibit the 
interaction of ligand with a GlUgR. With either agonists 

10 - or antagonists the metabolism and reactivity of cells 

which express the receptor are controlled, thereby 
providing a means to abate or in some instances prevent 
the .disease of interest* , 
Thus, the invention provides screening 

15 procedures for identifying agonists or antagonists of < 

events mediated by the ligand-GlxigR interaction. Such 
screening assays may employ a wide variety of formats , 
depending to some extent on which aspect of the 
ligand/ receptor/ G protein interaction is targeted. For 

20 example, such assays may be designed to identify 

compounds which bind to the receptor and thereby block or 
inhibit interaction of the receptor with the ligand. : 
Other assays can be designed to identify compounds which 
can substitute for ligand and therefore stimulate GlUgR- 

25 mediated intracellular pathways. Yet other assays can be 

used to identify compounds which inhibit or facilitate 
the association of Glu^ to G protein and thereby mediate 
the cellular response to GlUgR ligand. 

In one functional screening assay, the 

30 initiation of fertilization activation events are 

- monitored in eggs which have been injected with, e.g., 
: mRNA which codes for Glu^R and subsequently exposed to 
.selected compounds which are being screened, in 
conjunction with or apart from an appropriate ligand. 

35 See generally, Kline et al., Science 241:464-467 (1988) , 

incorporated herein- by reference. Oocytes injected with - 
mRNA coding for GIUqR can also b assayed by measurement 
of free cytosolic Ca 2 * as describ d above. 
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Anoth r sere ning assay is based on the use of 
mammalian cell lines which express GlUgR functionally 
coupled to a mammalian G protein. In this assay, 
compounds are screened for their relative affinity as 
receptor agonists or antagonists by comparing the 
relative receptor occupancy to the extent of ligand 
induced stimulation or inhibition of second messenger 
metabolism. For example, activation of phospholipase C 
leads to increased inositol monophosphate metabolism. 
Means for measuring inositol monophosphate metabolism are 
generally described in Subers and Nathanson, Molt. 
cell. Cardiol. 20:131-140 (1988), incorporated herein by 
reference. As noted previously, receptor subtypes that 
are coupled to the stimulation or inhibition of the 
second messenger adenylate cyclase can be used in assay 
systems wherein reporter gene (e.g. lucif erase) activity 
is linked to receptor-ligand interactions. 

The screening procedure can be used to identify 
reagents such as antibodies which specifically bind to 
the receptors and substantially affect their interaction 
with ligand, for example. The antibodies may be 
monoclonal or polyclonal, in the form of antiserum or 
monospecific antibodies, such as purified antiserum or 
monoclonal antibodies or mixtures thereof . For 
administration to humans, e.g., as a component of a 
composition for in vivo diagnosis or imaging, the 
antibodies are preferably substantially human to minimize 
immunogenicity and are in substantially pure form. By 
substantially human is meant generally containing at 
least about 70% human antibody sequence, preferably at 
least about 80% human, and most preferably at least about 
90-95% or more of a human antibody sequence to minimize 
immunogenicity in humans. 

Antibodies which bind GlUgR may be produced by a 
variety of means. The production of non-human antisera 
or monoclonal antibodies, e.g., murin , lagomorpha, 
equine, etc. is well known and may be accomplished by, 
for xample, immunizing th animal with the receptor 
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mol cule or a preparation containing a desired portion of 
the receptor molecule, such as that domain or domains 
which contributes to ligand binding. Receptor subtype- 
specific antibodies can be generated by immunizing with 
specific peptides. Small peptides (e.g., about 14-20 
amino acids) can be coupled to keyhole limpet hemocyanin, 
for example, to enhance immunogenicity. For the 
production of monoclonal antibodies, antibody producing 
cells obtained from immunized animals are immortalized 
and screened, or screened first for the production of 
antibody which binds to the receptor protein and then 
immortalized. As the generation of human monoclonal 
antibodies to human Glt^JR antigen may be difficult with 
conventional techniques, it may be desirable to transfer 
antigen binding regions of the non-human antibodies, e.g. 
the F(ab , ) 2 or hypervariable regions, to human constant 
regions (Fc) or framework regions by recombinant DNA 
techniques to produce substantially human molecules. 
Such methods are generally known in the art and are 
described in, for example, U.S. Patent No. 4,816,397 and 
EP publications 173,494 and 239,400, which are 
incorporated herein by reference. Alternatively, one may 
isolate DNA sequences which code for a human monoclonal 
antibody or portions thereof that specifically bind to 
the human receptor protein by screening a DNA library 
from human B cells according to the general protocol 
outlined by Huse et al . . Science 246:1275-1281 (1989), 
incorporated herein by reference, and then cloning and 
amplifying the sequences which encode the antibody (or 
binding fragment) of the desired specificity. 

In other embodiments, the invention provides 
screening assays conducted in vitro with cells which 
express the receptor. For example, the DNA which encodes 
the receptor or selected portions thereof may be 
trans fee ted into an stab li shed cell line, .g., a 
mammalian cell line such as BHK or CHO, using procedures 
known in the art ( see , e.g. . Sambrook et ai., Molecular 
Cloning. A Laboratory Manual, 2d ed., Cold Spring Harbor 
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Laboratory Pr ss, Cold Spring Harbor, N.Y., 1989, which 
is incorporated her in by refer nee). The receptor is 
then expressed by the cultured cells, and selected agents 
are screened for the desired effect on the cell, 
5 separately pr in conjunction with an appropriate ligand 

such as glutamate or quisqualate. Means for amplifying 
- nucleic acid sequences which may be employed to amplify 
sequences encoding the receptor or portions thereof are 
. described in U.S. Pat. Nos. 4,683,195 and 4,683,202, 
10 incorporated herein by reference. 

In yet another aspect, the screening assays 
provided by the invention relate to transgenic mammals 
whose germ cells and somatic cells contain a nucleotide 
sequence encoding Glu^R protein or a selected portion of 
15 the receptor which, e.g., binds ligand, GTP binding 

protein, or the like. There are several means by which a 
sequence encoding, for example, the human GlUgR may be 
introduced into a non-human mammalian embryo, some of 
which are described in, e.g., U.S. Patent No. 4,736,866, 
20 . jaenisch, science 240-1468-1474 (1988) and Westphal et 

, fir™ p^v- call Biol. 5:181-196 (1989) , which are 
incorporated herein by reference. The animal's cells 
theiji express the receptor and thus may be used as. a 
convenient model for testing or screening selected 
25 agonists or antagonists . 

In another aspect the invention concerns 
diagnostic methods and compositions. By means of having 
the Glu Q R molecule and antibodies thereto, a variety of 
diagnostic assays are provided. For example, with 
30 antibodies, including monoclonal antibodies, to GlUgR, the 

presence and/or concentration of receptor in selected 
cells or tissues in an individual or culture of interest 
may be determined. These assays can be used in the 
diagnosis and/or treatment of diseases such as, for 

xample, cerebral ischemia* Parkinsons, senile dementia 
and other cognitive disorders, Huntington' s chorea, 
amyotrophic lateral sclerosis, emesis, migraine, and 
others. 



35 
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Numerous types of immunoassays are availabl 
and are known to those skilled in th art/ .g., 
competitive assays, sandwich assays, and the like, as 
generally described in, e.g., U.S. Pat. Nos. 4,642,285; 
4,376,110; 4,016,043; 3,879,262; 3,852,157; 3,850,752; 
3,839,153; 3,791,932; and Harlow and Lane r Antibodies, A 
Laborat ory Manual . Cold Spring Harbor Publications, N.Y. 
(1988) , each incorporated by reference herein. In one 
assay format GIUqR is identified and/or quantified by 
using labeled antibodies, preferably monoclonal 
antibodies which are reacted with brain tissues, e.g., 
cortex, striatum, hippocampus, cerebellum, and 
determining the specific binding thereto, the assay 
typically being performed under conditions conducive to 
immune complex formation. Unlabeled primary antibody can 
be used in combination with labels that are reactive with 
primary antibody to detect the receptor. For example, 
the primary antibody may be detected indirectly by a 
labeled secondary antibody made to specifically detect 
the primary antibody. Alternatively, the anti-Glu^R 
antibody can be directly labeled. A wide variety of ' 
labels may be employed, such as radionuclides, particles 
(e.g., gold, ferritin, magnetic particles, red blood 
cells), fluorophores, chemiluminescers , enzymes, enzyme 
substrates, enzyme cof actors, enzyme inhibitors, ligands 
(particularly haptens), etc. 

The GlUgR DNA may be directly detected in cells 
with a labeled Glu^R DNA or synthetic oligonucleotide 
probe in a hybridization procedure similar to the 
Southern or dot blot. Also, the polymerase chain 
reaction (Saiki et ah. Science 239:487 (1988) , and U.S. 
Pat. No. 4,683,195) may be used to amplify DNA sequences, 
which are subsequently detected by their characteristic 
size on agarose gels, Southern blots of these gels using 
GIUqR DNA or a pligonucl otide prob , or a dot blot using 
similar probes. The probes may c mprise from about 14 : 
nucleotides to about 25 or more nucleotides, preferably, 
* 40 to 60 nucleotid s, and in some instanc s a substantial 
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portion or even the ntire cDNA f GlUgR may be used- The 
prob s ar labeled with a detectable signal, such as an 
enzyme, biotin, a radionuclide, fluorophore, 
chemiluminescer, paramagnetic particle, etc. 
5 Kits can also be supplied for use with the 

receptor of the subject invention in the detection of the 
presence of the receptor or antibodies thereto, as might 
be desired in the case of autoimmune disease. Thus, 
antibodies to GlUgR, preferably monospecific antibodies 

10 such as monoclonal antibodies, or compositions of the 

receptor may be provided, usually in lyophilized form in 
a container, either segregated or in conjunction with 
additional reagents, such as anti-antibodies, labels, 
gene probes, polymerase chain reaction primers and 

15 polymerase, and the like. 

The following examples are offered by way of 
illustration, not by limitation* 



20 EXAMPLE I 

Preparatio n of Glu^ enriched mRNA 

Total RNA was prepared from the cerebellum of 
rats using guanidine isothiocyanate (Chirgwin et al. 

25 Biochemistry 18:52-94 (1979)) and CsCl centrifugation. 

Poly(A)+ RNA was isolated using oligo d(T) cellulose 
chromatography. After 2 rounds of chromatography on 
oligo d(T) cellulose the RNA (800 /xg) was divided into 
two aliquots and layered over 10-40% linear sucrose 

30 gradients in tubes for an SW 28 rotor. The gradients 

were centrifuged for 28 hours at 25,000 rpm to pellet RNA 
greater than 4 kb in size. The enriched RNA was injected 
into frog oocytes and assayed for the presence of the 
GIUqR. 

35 - . . \ 
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Tniecti n of oocytes and voltaae-clamp assay of Gltt^ 
activity 

Oocytes were prepared from ovarian lobes that 
were surgically removed from anesthetized Xenoous 
females. The ovarian lobes were washed, pulled apart 
into small clumps and dissociated by treatment with 
collagenas^ for 2-3 hours at 20*C with constant, gentle 
agitation. The dissociation and defolicularization of 
the oocytes is completed manually after removal of the 
collagenase. Oocytes that were judged healthy and 
greater than 1 mm in diameter were transferred to a 50 mm 
sterile tissue culture dish and incubated in sterile, 
antibiotic-supplemented Barth's medium (88 mM NaCl, ImM 
KC1, 0.82 mM MgS0 4 , 0.33 mM Ca(N0 3 ) 2 , 0.41 mM CaCl 2 , 2.4 mM 
NaHC0 3 , 10 mM HBPES, pH 7.4, 0.1 mg/ml gentamicin, 0*01 
mg/ml penicillin, 0.01 mg/ml streptomycin, 0*5 mM 
theophylline, and 2.5 mM Na pyruvate) at 19 # C. 

Injection pipettes were pulled from hard glass 
tubing (Drummond) on a modified 700C Kopf vertical 
puller. The tip was broken and bevelled using a List 
Medical micro forge. Tip diameters of the pipettes ranged 
from 20-30 mM. Injection pipettes were made RNase free 
by heating to 285* C overnight. 

Following overnight incubation, healthy oocytes 
were selected for injection. RNA, which was stored at 
-70 *C in DEPC-treated water, was thawed and cehtrifuged 
at 15,000 g for five minutes. Injection was performed 
using a modified pipetting device (Drummond) • After 
injection, the oocytes were incubated in fresh, sterile 
Barth's medium which was changed daily, and unhealthy 
oocytes were removed. 

Voltage-clamp assays were carried out on 
injected oocytes which were each placed in a small 
chamber of approximately 500 /il in volume and which was 
continuously p rfused with standard frog Ringer's (115 mM 
NaCl, 2.5 mM KC1, 1.8 mM CaCl 2 , 10 mM HEPES, pH 7.2) at 
1-6 ml/min. The oocyte was impaled with two glass 
microelectrodes for recording which, when filled with 3 M 
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KCl, had a tip resistanc of 0.5 to 7.0 megaohms. One of 
, the two lectrodes was connect d to a diff rential 
amplifier via a silver/silver chloride half cell. The 
bath potential was measured by connecting the other side 
of the differential amplifier to the bath via a 
silver/silver chloride pellet and a Ringer/Agar bridge. 
A low noise, high compliance, voltage-clamp system (NPI) 
was used to control the membrane potential and to measure 
membrane current. The oocyte membrane potential was 
maintained at -60 mV (inside cell negative) . One 
millimolar glutamate (Sigma) , 100 W quisqualate (Sigma) , 
1 mM carbamylcholine (Sigma) , and the other drugs used in 
this assay were applied by switching the perfusing medium 
to a medium containing a drug for approximately three 
minutes, and the membrane current was recorded on a chart 
recorder (Linear Instruments). 

After impaling the oocyte with the two 
microelectrodes, and imposing the voltage-clamp, the 
membrane current (the holding current) gradually declines 
to a steady state over a period of several minutes. When 
the holding current stabilizes, so that the chart record 
is horizontal, the drug is applied for one to three 
minutes. An oocyte is judged to have a positive response 
if a rapid inward current spike (downward deflection on 
the chart) , followed by slow current oscillations of 
decreasing magnitude, is observed. Our lower limit of 
detection depended on the steadiness of the holding 
current prior to drug application, but was in the range 
of 5<0 n&. 

Construction of PVEGT' 

To permit transcription of cloned cDNA without 
prior endonuclease digestion, bacteriophage T7 
transcriptional terminators were added to a cloning 
vector. Plasmid pVEGT* is described in copending 
U.S. S.N. 07/581,342, which is incorporated by referenc 
her in. The segu nee of the putative T7 RNA 
transcription t rminator, which lies between gene 10 and 
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gen^ 11 of bacteri phag T7, is disclosed by Dunn and 
Studier f J. Mol. Biol. 166: 477-536 (1983)). As shown in 
Figure 5, four synthetic oligonucleotides were designed 
from this sequence and ligated into the vector pGEM-1 
(obtained frbm Promega Biotec, Madison, WI ) , a plasmid 
containing a bacterial origin of replication, ampicillin 
resistance gene, and the T7 promoter adjacent to a 
multiple cloning site. Terminal phosphates were added to 
the 5* ends of oligonucleotides 2C776 and 2C777 (Sequence 
ID Nos. 4 and 5) with T4 polynucleotide kinase and ATP, 
under standard conditions (Maniatis et al. ibid). (The 
sequences of these and other oligonucleotides referred to 
herein are shown in Table 1. ) After the incubation, the 
kinase was heat killed at 65*0 for 10 min. Twenty-five 
nanograms of oligonucleotide ZC775 (Sequence ID Number 3) 
and 25 ng of oligonucleotide ZC776 (Sequence ID Number 4) 
were annealed by incubation at 65 # C for 15 minutes, then 
allowed to cool to room temperature in 500 ml of water. 
Oligonucleotides ZC777 and ZC778 (Sequence ID Nos. 5 and 
6) were similarly annealed. The annealed 
oligonucleotides were stored at -20 # C until use. The 
vector pGEM-1 was digested with Pst I and Hind III, and 
the linearized vector DNA was purified by agarose gel 
electrophoresis. The synthetic T7 terminator (annealed 
oligonucleotides ZC775, ZC776, ZC777 and ZC778; Sequence 
ID Nosu 3, 4, 5 and 6) wais then cloned into pGEM-1. 
Twenty-- five nanograms of vector plus an equal molar 
amount | of each of the annealed oligonucleotides 
ZC775/ZC776 (Sequence ID Nos. 3 and 4) and ZC777/ZC778 
(Sequence ID Nos. 5 and 6) were combined in a 10 ^1 
reaction mix. After an overnight ligation at 14 # C, the 
DNA wad transformed into competent £. S91L JM83 cells, 
and the transformed cells were selected for ampicillin 
resistance, Plasmid DNA was prepared from selected 
transformants by the alkaline lysis procedure (Birnboim 
and Doljy, Nuc. Acids Res. 7:1513-1523 (1979)) . A portion 
of th DNA from these samples was cut with Pst I and Hind 
III and analyz d on a 4% polyacrylamid gel to id ntify 
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cl nes that released an 80 bp Pst I-Hind III fragment. 
Other diagnostic cuts, such as Eco RI and Not I, were 
also made. One of the isolates, designated pGEMT, was 
shown by restriction analysis to contain the T7 
terminator fragment. 

. Table i 

Oligonucleotide Sef pienees r5' - 3') 

ZC775 (Sequence ID Number 3): 
' GCT AGC ATA ACC CCT TGG GGC CTC TAA ACG GGT CT 

ZC776 (Sequence ID Number 4) : 

CTC AAG ACC CGT TTA GAG GCC CCA AGG GGT TAT GCT AGC TGC A 

ID Number 5): 

TGC TGA AAG GAG GAA CTA TGC GGC CGC A 
ID Number 6) : 

CAT AGT TCC TCC TTT CAG CAA AAA ACC C 

ZC1751 (Sequence ID Number 7) : 
AAT TCT GTG CTC TGT CAA G 

ZC1752 (Sequence ID Number 8): 
GAT CCT TGA CAG AGC ACA G 

ZC2063 (Sequence ID Number 9): 
GAT CCA AAC TAG TAA AAG AGC T 



ZC777 (Sequence 

L . 

TGA GGG GTT TTT 

ZC778 (Sequence 
AGC TTG CGG CCG 



ZC2064 (Sequenc ID Number 10) : 
CTT TTA CTA GTT TG 
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(Table 1, continued) 

ZC2938 (Sequence ID Number 11) : 

GAC AGA GCA CAG ATT CAC TAG TGA GCT CTT TTT TTT TTT TTT T 

ZC3015 (Sequence ID Number 12) : 
TTC CAT GGC ACC GTC AAG GCT 

ZC3016 (Sequence ID Number 13): 
AGT GAT GGC ATG GAC TGT GGT 

ZC3652 (Sequence ID Number 14) : 
ACA TGC ACC ATG CTC TGT GT 

l ' . 

/ . * ' . 

ZC3654 (Sequence ID Number 15): 
AGT GAT GGC ATG GAC TGT GGT 

The native T7 terminator from plasmid pAR2 52 9 
(Rosenberg et al., Gene 56:125-135 (1987)) was added to 
plasmid pGEMT. Plasmid pGEMT was digested with Bam HI and 
plasmid pAR2529 was digested with Bam HI and Bgl II 
(Figure 1). The Bam HI-Bgl II terminator fragment from 
PAR2529 was purified by agarose gel electrophoresis. The 
terminator fragment was ligated to Bam HI digested pGEMT, 
and the DNA was transformed into competent £. sali LM1035 
cells. Colonies that were ampicillin resistant were 
inoculated into 5 ml cultures for overnight growth. 
Plasmid DNA prepared by the t alkaline lysis procedure was 
screened for proper terminator orientation by Bam Hi-Sal 
I dig stion and electrophoresis on an 8% polyacrylamide 
g 1. A clon that contain d the terminator in th 
correct ori ntation, as evidenced by the pres nee of a 
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130 bp Bam Hi-Sal I fragment, was chos n and named pGEMTT 
(Figure 1) . 

To allow pGEMTT to be packaged as single-stranded 
DNA in the presence of M13 phage proteins, the M13 
intergenic region from pUC382 (similar to pUC118 and 119 
as disclosed by Vieira and Messing, Methods Engvmpl, 153: 
3-11 (1987) was added to pGEMTT (Figure 1) . Plasmid 
pGEMTT was digested with Fsp I and Nar I, and the 
fragment containing the T7 promoter and transcription 
terminator was purified. Plasmid pUC382 was digested 
with Fsp I and Nar I, and the fragment encoding the 
ampicillin resistance gene and the M13 intergenic region 
was gel purified. These fragments were then ligated 
together in the presence of T4 DNA ligase. The ligated 
DNA was transformed into competent fij. sali LM1035 cells. 
Plasmid DNA from twelve ampicillin-resistant colonies was 
prepared by the alkaline lysis method, and the DNA was 
screened by digestion with Ava I. The appropriate 
construction gave two bands, one of 2430 bp and another 
of 709. bp. One such isolate was chosen and named pVEG. 
Synthetic oligonucleotides encoding the prime sequence 
were added to pVEG between the Bam HI and Eco RI sites 
(Figure 1) . Plasmid pVEG was digested with Bam HI and 
Eco RI and the vector fragment was gel purified. 
Ninety-six nanograms each of oligonucleotides ZC1751 and 
ZC1752 (Sequence ID Nos. 7 and 8) were annealed in 4.5 .pi 
of 10 mM Tris pH 7.5, 20 mM MgCl 2 and 10 mM NaCl at 65*C 
for 20 minutes, then the mixture was cooled to room 
temperature over a period of 30 minutes. The annealed 
oligonucleotides were ligated to the pVEG vector fragment 
with T4 DNA ligase and then transformed into competent fi. 
coll LMi035 cells. After growing overnight to develop 
the colonies, a filter lift was taken of the colonies on 
the agar plate. The filter was probed with P-labeled 
oligonucleotid ZC1751 (Sequ nc ID Number 7) . All of 
th colonies w re positive. Plasmid DNA was pr pared 
from cultures grown from 12 of the colonies. The plasmid 
DNA was scr ened by digestion, with Sst I to verify the 
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absence of the Sst I site between the Eco RI and Bam HI 
sites of pVEG. All 12 of the plasmid DNAs were negative 
for Sst I digestion. One of these 12 isolates was chosen 
and named pVEG 1 . 
5 A polyadenylate sequence derived from an 

Aspergillus alcohol dehydroigenase cDNA was added to pVEG. 
As shown in Figure 1, plasmid pM098 (disclosed in 
published European patent application EP 272,277 and 
deposited with American Type Culture Collection under 

10 accession number 53428) was digested with Dra I and Bam 

HI, and the approximately 150 bp poly (A) fragment was 
purified by agarose gel electrophoresis. This fragment 
contained mostly poly (A) sequence with very little 
flanking cDNA. To clone the poly (A) cDNA fragment into 

15 pVEG, pVEG was digested with Bam HI and Sma I, and the 

3.4 kb vector fragment was gel purified. The vector and 
poly (A) fragments were ligated together with T4 DNA 
ligase to produce vector pVEGT (Figure 1). 

Synthetic oligonucleotides encoding the prime 

20 sequence were added to pVEGT* To accomplish this, . pVEGT 

was digested with Not I and Sst I, and the 370 bp 
fragment containing the poly (A) sequence and the two T7 
transcriptional terminators was purified by agarose gel 
electrophoresis. Plasmid pVEG 1 was digested with Not I 

25 and Bam HI, and the 3.2 kb vector fragment was 

gel-purified. Two oligonucleotides (ZC2063 and ZC2064; 
Sequence ID Nos. 9 and 10) that formed, when annealed, a 
Bam Hl-Sst I adapter were synthesized. The two 
oligonucleotides were individually kinased and annealed, 

30 and ligated with the linearized vector and the 

poly (A) -terminator fragment. The resultant vector, 
designated pVEGT' (Figure I), contained a T7 RNA 
transcription promoter, an Eco RI cloning site flanked by 
the prime sequence, a poly (A) tract, and two T7 RNA 

35 polymerase terminators. 



* 
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Construction of cDNA Hhrarv from rat cerebellum PP3.V(A) + 

, Because there was evidence suggesting that the 
GlUgR was encoded a very large mRNA of 7 kb (Fong, 
Davidson, and 1 Lester, synapse 2:657 (1988) ) and because 
full length cDNA encompassing the coding sequence is 
required for functional cloning of cDNA, measures were 
taken to optimize for synthesis of large cDNA. A novel 
method of cDNA synthesis was developed which yielded 
large full length cDNA. This was evident by 
demonstration that full length 7.5 kb cDNA could be 
synthesized from a model 7.5 kb mRNA and that large full 
length: cDNA were present in a library constructed from 
poly (A) + RNA as demonstrated by Southern blot analysis . 
In addition, all enzymes which were important in this 
method were pretested and selected from a large number of 
lots of enzymes available from commercial suppliers. 
Once a satisfactory lot was identified, a large amount of 
the enzyme was purchased and the enzyme was stored at 
-70 •C until used. Once used, the enzyme was stored at 
-20 # C for a few months and then discarded. Different 
•'lots" of enzymes from commercial suppliers, including 
lots of superscript reverse transcriptase (BRL) , £. £2li 
DNA polymerase I (Amersham) and Mung bean nuclease (NEB) , 
which were used in the cDNA synthesis, were screened for 
quality, in test synthesis assays, superscript reverse 
transcriptase lots were assayed for the ability to 
synthesize unit length (7.5 kb) first strand cDNA from 
7.5 kb RNA (BRL) control. Conditions for first strand 
synthesis with Superscript reverse transcriptase lots 
Were prepared as described below. Radiolabeled first 
strand cDNA was analyzed by alkaline agarose gel 
electrophoresis, superscript lots capable of producing 
unit length, 7.5 kb cDNA were selected for use. 

1. coli DNA polymeras I lots w re assayed for the 
ability -t produce, by hairpin DNA formation, full-length 
second strand cDNA from the 7.5 kb unit-length first 
strand cDNA. The second strand cDNA syntheses were 
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carried out as described below, Th quality f the 
second strand syntheses were assessed by alkaline agarose 
electrophoresis of the radiolabeled product. DNA 
polymerase I lots capable of producing 15 kb second 
strand DNA from the 7.5 kb unit length first strand cDNA 
were selected for use. 

Mung bean nuclease lots were tested for the 
ability to clip the hairpin DNA formed during second 
strand synthesis without degrading the cDNA. In 
addition, varying concentrations of enzyme were added to 
determine the optimum enzyme concentration for the 
conditions set forth below. The reactions were assessed 
by alkaline agarose .electrophoresis. Lots and 
concentrations resulting in the production of 7.5 kb unit 
length cDNA were selected for use. 

Total RNA was prepared from rat cerebella using 
guanidine isothiocyanate (Chirgwin et al. Biochemistry 
18:52-94 1979) and CsCl centrifugation (Gilsin et al. 
Biochemistry 13:2633-2637 1974). Poly (A) + RNA was 
selected from the total RNA using oligo d(T) cellulose 
chromatography (Aviv and Leder, Proc . Natl . Acad . Sci . 
USA 69:1408 (1972)). 

First strand cDNA was synthesized from one time 
poly d(T) -selected cerebellum poly (A) + RNA in two 
separate reactions. One reaction, containing * 
radiolabeled dATP, was used to assess the quality of 
first strand synthesis. The second reaction was carried 
out in the absence of radiolabeled dATP and was used, in 
part, to assess the quality of second strand synthesis. 
Superscript reverse transcriptase (BRL) was used 
specifically as described below. A 2.5x reaction mix was 
prepared at room temperature by mixing, in order, 10 /il 
of 5x reverse trcuiscriptase buffer (BRL; 250 mM Tris-HCl 
pH 8.3, 375 mM KCl, and 15 mM MgCl a ) , 2.5 /il 200 mM 
dithiothreitol (made fresh or stored in aliquots at 
-70 # C) and 2.5 /il of a d oxynucleotide triphosphate 
solution containing 10 mM each f dATP, dGTP, dTTP and 
5-methyl dCTP (Pharmacia) . The reaction mix was 
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aliguoted into two tub s of 7.5 Ml a<=h. To the first 
tube, 1.3 Ml of 10 MCi/Ml o 32 P-dATP (Am rsham) was added 
and 1.3 jii of water was added to the second reaction 
tube. Seven microliters from each tube was transferred 
to reaction tubes. Fourteen microliters of a solution 
containing 10 ng of cerebellum poly (A) + RNA diluted in 14 
Ml of 5 mM Tris-HCl pH 7.4, 50 MM EDTA was mixed with 2 
Ml of 1 jig/Ml first strand primer, ZC2938 (Table l; 
Sequence ID No. 11) , and the primer was annealed to the 
RNA by heating the mixture to 65 *C for 4 minutes, 
followed by chilling in ice water. Eight microliters of 
the RNA-primer mixture was added to each of the two - 
reaction tubes followed by 5 Ml of 200 U/m1 Superscript 
reverse transcriptase (BRL) . The reactions were mixed 
gently, and the tubes were incubated at 45 »C for 30 
minutes. After incubation, 80 >1 of 10 mM Tris-HCl pH 
7.4, 1 mM EDTA was added to each tube, the samples were 
vortexed and centr if uged briefly. Three microliters of 
each reaction was removed to determine total counts and 
TCA precipitable counts (incorporated counts) . - Two 
microliters of each sample was analyzed by alkaline gel 
electrophoresis to assess the quality of first strand 
synthesis. The remainder of each sample was ethanol 
precipitated. The nucleic acids were pelleted by 
centrifugation, washed with 80% ethanol and air dried for 
ten minutes. The first strand synthesis yielded 1.4 fig 
of cerebellum cDNA or a 28% conversion of RNA into DNA. 

Second strand cDNA synthesis was performed on the 
RNA-DNA hybrid from the first strand reactions under 
conditions which encouraged first strand priming of 
second strand synthesis resulting in DNA hairpin 
formation. The nucleic acid pellets containing the first 
strand cDNA were resuspended in 71 Ml of water. To 
assess the quality of second strand synthesis, e 32 P-dATP 
was added to the unlabeled first strand cDNA. To 
ncourage formation of the hairpin structur , all 
reag nts except th enzymes were brought to room 
temperature , and the reaction mixtur s were set up at 
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room temperature. (Alternatively, the reagents can be on 
ice and th reaction mixture set up at room t mp rature 
and allowed to equilibrate at room temperature for a 
short time prior to incubation at 16 'C.) Two reaction 
tubes were set up for each synthesis. One reaction tube 
contained the unlabeled first strand cDNA and the other 
reaction tube contained the radiolabeled first strand 
cDNA. To each reaction tube, 20 /il of 5x second strand 
buffer (100 mM Tris, pH 7.4, 450 mM KC1, 23 mM MgCl 2 , 50 
mM (NH 4 ) 2 S0 4 ), 3 (il of beta-NAD and 1 |il of a 
deoxynucleotide triphosphate solution containing 10 mM 
each of dATP, dGTP, dTTP and dCTP (Pharmacia), 1 Ml 
cr 32 P-dATP or 1 /xl of water (the radiolabeled dATP was 
added to the tube containing the unlabeled first strand 
cDNA) , 0.6 /xl of 7 U/m1 E- coli DNA ligase 
(Boehringer-Mannheim) , 3.1 /il of 8 U//il I- coli DNA 
polymerase I (Amersham) , and 1 /il of 2 U//il of BNase H 
(BRL) . The reactions were incubated at 16 *C for 2 hours. 
After incubation, 3 /il was taken from each reaction tube 
to determine total and TCA precipitable counts. Two 
microliters of each sample was analyzed by alkaline gel 
electrophoresis to assess the quality of second strand 
synthesis by the presence of a band of approximately 
twice unit length. To the remainder of each sample, 2 /il 
of 2.5 /ig/Ml oyster glycogen, 5 /il of 0.5 M EDTA and 200 
Ml of 10 mM Tris-HCl pH 7.4, 1 mM EDTA were added, the 
samples were phenol-chloroform extracted, and isopropanol 
precipitated. The nucleic acids were pelleted by 
centrifugation, washed with 80% ethanol and air dried. 
The yield of double stranded cDNA in each of the 
reactions was approximately 2 /ig. 

The single-stranded DNA in the hairpin structure 
was clipped using mung bean nuclease. Each second strand 
DNA sample was resuspended in 12 Ml of water. Two 
microliters of lOx mung bean buffer (0.3 M NaOAC, pH 4.6, 
3 M Nad, 10 mM 2nS0 4 ), 2 Ml of 10 mM dithi threit 1, 2 Ml 
of 50% glycer 1, and 2 m! of 10 U/Ml mung b an nuclease 
(NEB, lot 7) wer add d to each tube, and the reactions 
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w r incubated at 30 *C for 30 minutes. After incubation, 
80 nl of 10 mM Tris-HCl pH 7.4, 1 mM EDTA was added to 
each tube, and 2 pi of each sample was subjected to 
alkaline gel electrophoresis to assess the cleavage of 
the second strand product into unit length cDNA. one 
hundred microliters of 1 M Tris-HCl pH 7.4 was added to 
each sample, and the samples were twice extracted with 
phenol-chloroform. Following the final phenol-chloroform 
extraction, the DNA was isopropanol precipitated. The 
DNA was pelleted by centrifugation, washed with 80% 
ethanol and, air dried. Approximately 2 fig of DNA was 
obtained from each reaction. 

The cDNA was blunt-ended with T4 DNA polymerase 
after the cDNA pellets were resuspended in 12 Ml of 
water. Two microliters of 10X T4 DNA polymerase buffer 
(330 mM Tris-acetate, pH 7.9, 670 mM KAc, 100 mM MgAc, 1 
mg/ml gelatin) , 2 fil of 1 mM dNTP, 2 Ml 50 mM 
dithiothreitol, and 2 pi of 1 U/Ml T4 DNA polymerase 
(Boehringer-Mannheim) were added to each tube. After an 
incubation at 15 "C for 1 hour, 180 fil of 10 mM Tris-HCl 
pH 7.4, 1 mM EDTA was added to each sample, and the 
samples were phenol-chloroform extracted followed by 
isopropanol precipitation. The cDNA was pelleted by 
centrifugation, washed with 80% ethanol and air dried. 
Eco Rl adapters (Invitrogen, Cat. # N409-20) were ligated 
to the blunted cDNA after the DNA from each reaction was 
resuspended' in 6.5 Ml water. 

The first strand primer encoded an Sst I cloning 
site to allow the cDNA to be directionally cloned into an 
expression vector. The cDNA was digested with Sst I 
followed by phenol-chloroform extraction and isopropanol 
precipitation. After digestion, the cDNA was 
electrophoresed in a 0.8% low melt agarose gel, and the 
cDNA over 4.2 kb was electroeluted using an Elutrap 
(Schleicher and Schu 11, Keene , NH) . The electroeluted 
cDNA in 500 Ml of buff r was isopropanol precipitat d and 
the cDNA was pelleted by centrifugation. The cDNA pell t 
was washed with 80% thanol. 
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■ A cerebellum cDNA library was. stablished by 
ligating th cDNA to th Eco Rl-Sst I digested, agarose 
gel purified pVEGT*. 

Ten sublibraries of one million clones each were 
constructed representing a library of ten million 
independent clones. To prepare each sublibrary, 80 ng of 
linearized vector were ligated to 40 ng of cDNA. After 
incubation at room temperature for 11 hours, 2.5 /xg of 
oyster glycogen and 80 /xl of 10 mM Tris-HCl, 1 mM EDTA 
was added and the sample was phenol-chloroform extracted 
followed by ethanol precipitation. The DNA was pelleted 
by centr i f ugat ion , and the DNA pellet washed with 80% 
ethanol. After air drying, the DNA was resuspended in 3 
ftl of water. Thirty-seven microliters of 
electroporation-competent DH10B cells (BRL) was added to 
the DNA and electroporation was completed using a BioRad 
electroporat ion unit. After electroporation, 4 ml of SOC 
(Maniatis et al. ) was added to the cells, and 400 fil was 
spread on each of 10-150 mm LB ampicillin plates. Each 
plate represented a sub library of 100,000 clones. After 
an overnight incubation, the cells were harvested by 
adding 10 ml of LB ampicillin media to each plate and 
scraping the cells into the media. Glycerol stocks and 
plasmid DNA were prepared from each plate. The library 
background (vector without insert) was established at 
about 15%. 

1 . ■ ■ . • ■ 

Detectkon of Glu,^ activity from the cDNA library 
; The Xenop us oocyte efficiently translates 
exogenously added mRNA. Preliminary experiments were 
done using the mouse ml muscarinic receptor cDNA (a 6 
proteih-coupled receptor that can be detected by voltage- 
clamp) cloned into pVEGT 1 . Injection of RNA transcribed 
in vitro from increasing dilutions of the ml template DNA 
indicated that ml ag nist induced activity could be 
det cted f r ne cl n in a pool size of 100,000. A 
cerebellum sub library was plated into ten p ols of 
100,000 unique clones. 
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The pools could also be replica plated onto a 
nitrocellulose filt r and th original and r plica 
allowed to grow for a few hours. The original plate is 
scraped to harvest all the colonies. Plasmid DNA is 
prepared and purified by cesium chloride gradient 
ultracentrifugation. The DNA from each pool is 
transcribed in vitro with T7 RNA polymerase in the 
presence of 7-methyl-G, the capped nucleotide, to 
increase translation efficiency. Template DNA 
transcription reactions are spiked with a dilution of two 
control genes cloned into pVEGT ' : the mouse ml gene and a 
secreted version of the human placental alkaline 
phosphatase gene (SEAP; Tate et al., ffr d - Am. Soc. Exp. 
Biol. 8: 227-231 (1990), incorporated by reference 
herein). Transcription from the control genes would 
allow selection of oocytes that more efficiently 
translate the injected RNA, and a determination whether 
oocytes that are negative for the GlUgR are true 
negatives, that is, still having a detectable ml agonist- 
induced response. 

Plasmid DNA prepared from each of the 10 pools of 
100,000 clones, which in total represented one sublibrary 
of one million clones of the cerebellum cDNA library, was 
purified by cesium chloride gradient ultracentrifugation. 
The DNA was transcribed in vitro with T7 RNA polymerase 
(Pharmacia) in the presence of capped nucleotide (GpppG, 
Pharamcia). The presence of a poly (A) sequence and two 
T7 RNA polymerase terminators in pVEGT 1 resulted in RNA 
with a capped 5' end, the sequence of the cDNA insert, 
and 3' poly (A) tails. Capped RNA is believed necessary 
for efficient translation in oocytes (Noma et ali HafcaES 
319:640 (1986)) and the poly (A) sequence has been shown 
to increase the synthesis of a protein in oocytes by more 
then 40 fold. The transcription reaction tubes were set 
up by adding 12 Ml of 5x transcription buffer (Stratag ne 
Cloning Systems, La J 11a, CA) , 3 Ml ach of 10 mM ATP, 
CTP, GTP, and UTP, 6 Ml of 10 mM GpppG (Pharmacia), 6 Ml 
of 1 mg/ml BSA, 3 Ml of 200 mM DTT, 1.5 Ml of 40 U/m1 
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RNasin (ProMega Biotech, Madis n, WI), 8.5 Ml f wat r, 
10 Ml of cDNA containing 5 t 10 jig DNA, and 1 Ml of 70 
U/Ml T7 RNA polymerase. After mixing, 10 Ml of the 
reaction was transferred to a tube containing 0.5 pCi of 
5 a 32 P-UTP to determine the total counts and counts 

incorporated into RNA. The samples were incubated at 
37 'C for one hour. The cDNA in the unlabeled samples was 
degraded with the addition of 1 Ml of 200 mM DTT, 2 Ml of 
30 U/m1 DNase I, and 0.5 Ml of 40 U/m1 RNasin and the 

10 incubation was continued at 37 # C for 15 minutes. Forty 

microliters of water was added to the radiolabeled 
reactions, and 1 m1 was removed from each sample and 
counted to determine total counts. The remainder of the 
labeled samples were ethanol precipitated. The samples 

15 were centrifuged to collect the RNA and the RNA pellets 

were counted to determine the counts incorporated into 
RNA. After the DNA degradation reaction in the unlabeled 
samples, 70 Ml of 10 mM Tris-HCl , 1 mM EDTA was added to 
each sample, and the samples were twice-extracted with 

20 phenol-chloroform followed by one chloroform extraction. 

The RNA was ethanol precipitated. After centr if ugat ion 
to collect the RNA, the pellets were washed with 80% 
ethanol, followed by air drying for 10 minutes. A 
typical yield of the unlabeled RNA was 20 to 30 Mg* The 

25 • unlabeled RNA was. resuspended at 2 M9/M1 in 

diethylpyrocarbonate (DEPC, Sigma) treated water and 
stored at -70*C. 

Prior to microinjection into oocytes, the RNA 
samples were thawed and centrifuged in a microfuge for 5 

30 minutes to remove any particles that might clog a 

microinjection pipet. After centrifugation, 80% of each 
sample was removed and split into two tubes. 

The RNA from each of the 10 sublibrarles were 
injected into oocytes as described above and translation 

35 was allow d for four days. Expression of GIUqR activity 

was ass ssed by yoltag -clamp assay as describ d above. 
One of the 10 sublibrari s, Z93-1.9, produced a signal 

with administration of quisqualat to th oocyte. 

* ■ - ■ 

^ ■ 
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The DNA pool (Z93-1.9) was subdivided by plating 
clones from the glycerol stock onto LB ampicillin plates. 
5 to determine the number of clones that should be plated 

for the subdivision of the 100,000 clone pool to identify 
a positive clone, the probability equation N - In (1- P) 
/ In (1 - f) (Maniatis et al.> ibid.) was used, where P 
is the desired probability of including the clone of 
10 interest, f is the fraction of positive clones in the 

pool, and N is the number of clones to be plated to 
provide the given probability. For a probability of 
99.8% for a pool size of 100,000 to contain one positive 
clone, 621,461 clones should be plated. 
15 Forty-eight 150 mm LB ampicillin plates were 

■ plated with the glycerol stock representing the 100,000 
positive pool, Z93-1.9, at a density of approximately 
14,000 clones per plate to give a total of 670,000 
clones. After an overnight incubation 37*C, the bacteria 
2b on each plate were harvested into 10 ml of Solution I (as 

described by Birnboim and Doly , *fiTT, acids Res. 7: 1513 
. (1979) ), incorporated by reference herein) . A glycerol 
stock was prepared from a portion of the cells, and 
plasmid DNA was prepared from the . remainder of the cells. 
25 six pools of DNA representing eight of the LB ampicillin 

plates each were prepared by combining one tenth of the 
plasmid DNA from groups of eight plates into each pool . 
> The plasmid DNA from these six pools was purified by 

cesium chloride gradient centrif ugation . The DNA was 
30 transcribed into RNA as outlined above. Transcription of 

the parent pool Z95-1.9 was included as the positive 
control. Oocytes were injected with the RNA and 
voltage-clamp assays on the oocytes identified pool 
Z99-25-32 as positive for GlUgR. Pool Z99-25-32 contained 
35 DNA prepar d from plat s 25 through 32. 

Plasmid DNA from plates 25 to 32 were cesium 
chlbrid banded; and transcribed int RNA as d scribed 
above al ng with the positiv par nt pool Z99-25-32. 
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Oocytes were injected with the RNA and voltage clamp 
assays , carried out as described ibov , identified pools 
Z104-25 and Zlll-32 as being weakly positive, Z106-27 and 
Z109-30 as intermediately positive/ and Z108-29 and 
Z110-31 as the most positive. The pool resulting in 
Z110-31 was chosen for further subdivision. 

Identification of positive pools from the 
subdivision of the positive pool of 14,000 (Z110-31) from 
the glycerol stock was unsuccessful^ Therefore, plasmid 
DNA prepared from the pool resulting in Z110-31 was 
electroporated into bacteria and plated on 60 plates at a 
density of 1,000 clones/plate. Plasmid DNA was prepared 
from the bacteria harvested from each plate. Aliquot s of 
the plasmid DNA from each plate were mixed to make six 
pools representing ten plates each. The plasmid DNA was 
cesium chloride banded, and the RNA was transcribed as 
described above. RNA was transcribed from pools Z108-29, 
Z110r31, and a muscarinic receptor cDNA, ml, for use as 
positive controls. The RNA was injected into oocytes and 
voltage-clamp assays were carried out as described, above. 
^ The assays identified pool Z133-21 to 30 as positive. 

Plasmid DNA from plates 21 to 30 were cesium 
chloride banded and transcribed as described above. The 
transcribed RNA and the RNA from the parent pool Z133-21 
to 30 were injected into oocytes and assayed as described 
above. The voltage-clamp assay identified pool Z 14 2-2 2 
as positive. 

Identification of positive pools by the 
subdivision of the positive pool Z142-22 from a glycerol 
stock proved unsuccessful. . Restriction analysis of 
plasmid DNA prepared from randomly selected clones from 
pools Z110-31 (the pool of 14,000) and Z142-22 (the pool 
of 1,000) indicated that 50% of pool Z110 - 31 and 68% of 
pool Z 14 2 - 22 were clones without inserts. 

T assess physical methods for enriching for the 

•. . * • ..... - , 

GlUgR cl ne and to establish how many clones fr m pool: - 
Z142-22 need d t be assay d t include a GlUgR clone, 
undigest d plasnid DNA from po 1 Z142-22 was 
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electrophoreses 1 on an agarose gel. The sup r-coil band 
representing v ctor without insert was cut out and the 
remainder of ^the DNA was eluted from the gel; The DNA 
was then electroporated into bacteria cells, and plated 
at densitiesVof 3,400, 6,900, and 13,800 clones per 
plate!. The plates were replica plated and grown 
overnight. Plasmid DNA was prepared from the cells 
harvested from the replica of each plate. The plasmid 
DNA was transcribed, and the RNA was assayed in oocytes 
as described above. As a control, each pool contained 
the equivalent of one colony of ml as an internal 
positive control. In addition, ml was used as an . 
external positive control. The voltage-clamp assays 
identified the DNA from the 6,900 clone pool (Z167-7) as 
positive. 

The clones represented on the 6,900 clone plate 
that resulted in the positive pool Z167-7 were subdivided 
by replica plating the master plate onto a Biodyne-A 
nylon membrane on an LB ampicillin plate. The replica 
plate was incubated four hours at 37 *C. After 
incubation, sub-pools were prepared by removing the 
membrane from the plate, taping the membrane to a sterile 
glass plate on a light box, and overlaying the membrane 
with a grid which divided the membrane into 100 sections. 
The sections of the grid and underlying membrane were 
then cut out with a razor blade that had been dipped in 
alcohoi and flamed, between each cut. Alcohol-treated, 
flamed forceps were used to transfer each membrane 
section! to . a test tube containing 12.5 ml of LB 
ampicillin media. The cultures containing the membrane 
sections were incubated overnight at 37 »C. After 
incubation, 0.5 ml of each culture was mixed with 0.5 ml 
of 50% glycerol and stored at -70* C to establish glycerol 
stocks of each sub-pool. Aliquots of the 100 cultures 
wer po led in a 10 X 10 matrix with sampl s (1) through 
(10) onj the abscissa and samples (a) through (j) on the 
ordinate. . For exampl , 1 ml of cultur s (1) through (10) 
were added t tube 1 and 1 ml of cultur s (1) , (11) , 
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(21), (31), (41), (51), (61),. (71), (81), and (91) were 
added to tub (a) and so on until 10 rows of 10 and 10 
columns containing pools of 10 cultures each were 
completed. Ten microliters of an overnight culture 
containing ml-transf ormed bacteria was added to each pool 
as an internal control. Plasmid DNA was prepared from 
the 20 sub-pools, and the DNA was purified by cesium 
chloride gradient centrifugation. RNA was transcribed 
from the plasmid DNA and was assayed in oocytes as 
described above; Positive controls were the parent pool 
Z167-7 and pure ml RNA. The voltage-clamp assays 
indicated that only pools Z175-1 and Z191-g were 
positive. Consulting the matrix, this indicated that the 
membrane section number (7) contained the GlUgR clone. 

To subdivide the clones contained in section (7) , 
a piece of Biodyne A membrane was applied to the master 
plate containing section (7) , the membrane extending 
beyond section (7) on each side by half the width of 
section (7) . The membrane was removed from the plate, 
applied to a fresh LB ample ill in plate colony side v up, 
and incubated overnight at 3*7 *C. The membrane was 
subdivided as described above with the central region of 
the membrane, the actual section (7) area, divided into 9 
small, equivalent-sized squares and the membrane on each 
side of section (7) was taken as four additional areas. 
Each membrane section was used to inoculate a 10 ml 
liquid culture. Bacteria transformed with the ml clone 
were used as ah internal control in each culture as 
described above. After overnight incubation at 37 g C, 
plasmid DNA was prepared, and the DNA was purified by 
cesium chloride gradient centrifugation. RNA was 
transcribed and assayed in oocytes as described above 
using RNA from ml and the parent pool number (7) as 
positive controls. Glu^ activity was found in only pool 
Z2 03-7 corresponding to membrane section number (7). 

Pool Z203-7 was subdivided by electroporating the 
plasmid DNA prepared from the membrane section numb r (7) 
into DH10B electroporatibn-competent cells.: Th 
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transformants were plated at a density enabling 
individual colonies to be picked. Individuals clones 
were picked to a master plate and into 2 ml of LB 
ampicillin media. The cultures were incubated overnight, 
and plasmid DNA was prepared, by the method essentially 
described by Holms and Quigley (Anal. Bjoc. 114: 193, 
(1981)). Restriction analysis suggested that the clones 
were grouped into 7 different classes of clones. Plasmid 
DNA, prepared from each class, representing fifty total 
clones were prepared, transcribed*, and assayed in oocytes 
as described above. However, none of the clones were 
positive. 

To screen for positive clones, electropor at ion- 
competent 1. coll DH10B cells were electroporated with 
the DNA prepared from membrane section number (7) 
(Z203-7) and were plated at 180, 360/ 900, and 1800 
colonies per plate. The plates were incubated overnight, 
and replica plates were prepared as described above. 
Plasmid DNA prepared from each replica plate was combined 
with 1 to 1000 parts of ml as an internal control. The 
DNA pools, the ml clone and the parent pool Z 203 -7 were 
transcribed, and the RNA was assayed by oocyte injection. 
The first transcription and injection showed no 
positives, however,, upon retranscription and reanalysis 
the 1800 clone pool (Z264-1800) was positive for. GlUgR 

..■activity..'; / '* • ~ ■■ 

To subdivide the positive pool of 1800 

v (Z264-1860) , all of the colonies from the plate of 1800, 
1528 in total, were each picked to two 100 mm LB 
ampicillin agar plates on a 100 colony grid. After 
overnight growth* one set of the duplicate plates was 
designated as a master set and was placed at 4 # C. The 
other set was replica plated to a third set of plates. 
After overnight incubation of these plates , the cells on 
the replica plates were harvested int media and plasmid 
DNA was prepared from the pooled cells. As described 
abov , an internal ml control was includ d in each DNA 
" preparati n. ml DNA and th ' parent Z2 64-1800 DNA were 
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us d as xt rnal positive controls, Plasmid DNA prepar d 
from the 16 plates was transcribed, and the RNA was 
assayed in oocytes as described above; One of the pools 
of 100 clones, Z256-I produced GlUgR activity. 

To identify which clone of the 100 clones from 
Z2 5 6-1 produced the GIUqR activity, a 10 x 10 matrix of 
the clones was constructed. A liquid culture of each 
clone was grown: One milliliter of each culture was 
added to each of two tubes representing the appropriate 
. row and column of the 10 x 10 matrix. As described 
previously, plasmid DNA encoding ml was used as an 
internal positive control. Plasmid DNA prepared from 
each tube, ml DNA and DNA from the parent pool 2264-1800 
were transcribed and assayed in oocytes as described 
above. GlUgR activity was identified only in row (5) and 
column (e) . Thus, the positive clone number 45 was 
identified as containing the GIUqR activity. 

To confirm the result, plasmid DNA from clone #45 
was prepared, transcribed and assayed in oocytes as 
described above. The results of the assay indicated that 
clone #45 was capable of producing GlUgR activity. Figure 
2 illustrates the data taken from voltage-clamp 
recordings at several stages in the subfractionation of 
the cerebellum library. Panel (a) is a recorded response 
to quisqualate of an oocyte previously injected with in 
vitro transcribed RNA from a rat cerebellum sublibrary of 
100,000 independent colonies; panel (b) shows the 
response to quisqualate in a cell previously injected 
with RNA transcribed from a subfractionated pool of 
14,000 colonies. The peak current was truncated by the 
chart recorder, but the actual peak current (estimated 
from a digital panel meter) was approximately 1300 nA. 
Panel (c) shows the response to quisqualate in a cell 
injected with pure GlUgR RNA from clone 45-A. The amount 
of RNA injected per oocyte was approximately 100 ng, 
except in pan 1 (c) wher th am unt of RNA was 50 pg. 

The following describes an alt rnative means for 
subdividing and screening a positive pool. Working with 
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cDNA inserts in a plasmid based rather than a lambda- 
based vector influences th subtract ionat ion. protocol* 
Once a positive pool is identified, the replica filter is 
overlayed with another sterile nitrocellulose filter. 
5 The filter is cut into 88 pieces by using evenly spaced 

cuts of 10 rows and 10 columns to form a grid. Each of 
the 88 pieces; is transferred to 10 ml of sterile LB +Amp 
and grown for; several hours. Twenty pools are formed; C 
1-10 (corresponding to column number) and R 1-10 

10 (corresponding to row number) . An aliquot of each of the 

88 subtractions is pipetted into 2 tubes, corresponding 
to its position in a row and a column. DNA is isolated 
from the 20 pools, putified on CsCl gradients and 
transcribed in an in vitro reaction that includes the 

15 control ml and SEAP plasmids. After injection into 

oocytes and voltage-clamp recording there are 2 positive 
pools, pinpointing the location of 1 of the 88 original 
subtractions. 

Because the positive clone is still part of a pool 

20 it must be further subdivided. The probability equation 

described above is used to determine the number of clones 
to be plated for the next subdivision of the pool. The 
glycerol stock from the positive pool is plated out at, 
e.g., 3000, 6000 and 18,000 clones per plate. After 

25 replica plating the DNA is harvested, transcribed, 

injected and assayed. The pool which is positive is 
subdivided into a grid of 88 as described above. The 
assay is repeated, and a single square of the grid is 
positivk. At the next step of subdivision of the pool, 

30 100 individual colonies to a plate are picked, replica 

plated, j and 20 pools are made for transcription and 
assay. Positive clones are streaked out, several 
colonies picked and restriction mapped and template and 
transcript prepared for injection and assay. 
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To establish that the GIUqR encoded by clone 45-A 
couples to G-protein, clone 45-A Glu^ RNA was transcribed 
and injected into oocytes as described above. Two days 
after injection the oocytes were divided into control and 
toxin-treated groups. The oocytes in the toxin-treated 
group Were treated with a final concentration of 4 pq/tol 
of £. pertussis toxin (List Biological Laboratories Inc., 
Campbell , CA) , and both groups were incubated for 24 
hours at 19 9 C as described by Sugiyama et al. Nature 
325:531 (1987) and Moriarity et al. , J. Biol. Chem. 
264:13521 (1989), both of which are incorporated by 
reference herein. The oocytes from both the control and 
toxin-treated groups were subjected to voltage-clamp 
assays as described previously. In one example, oocytes 
perfused as described previously with 100 fM L-giutamic 
acid showed a mean L-glutamic acid-induced current of 
264.2 nA +/" 73 nA in control oocytes (SEM, n»6) and 57.7 
nA +/--"19 nA (n«9) in toxin-treated oocytes. The mean 
membrane current in the toxin-treated group was 
significantly smaller (p < 0.01) than in the control 
group suggesting that oocytes injected with 45-A RNA 
coupled to a pertussis toxin-sensitive G protein. 

L-glutamic acid and some of its structural 
derivatives that are known to activate GIUqR currents in a 
dose-dependent iaanner were applied to oocytes that had 
been injected with RNA transcribed from the 45-A clone. 
RNA was transcribed arid oocytes were prepared and 
injected as previously described. Dose dependent 
responses were measured using voltage clamp assays were 
carried out in the presence of increasing concentrations 
of L-glutamic acid (Sigma) , guisquallc acid (Sigma), 
ibotenic acid (Sigma), or trans l-amino-cyclopentyl-1,3 
dicarboxylic acid (tACPD; Tocris Neuramin, Essex, 
England) . Four or five separate oocyt s w re p rfused 
with increasing concentrations of a particular drug with 
30 minut s b tw n consecutive applications of the drug 
to minimize any int rf rence from desensitizatibn. Th 
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responses were normalized to a subsequent response to 100 
MM L-glutamic acid. The data were analyzed using the 

following equation: „ 
(Fractional current) » (Dose n )/(Dose n ) + (EC 50 ) ,. 

5 where: 

Dose - a dose of drug normalized to that evoked by 
a subsequent application of 100 juM L-glutamic acid; 

Fractional current - the peak current evoked, by a 

dose, as defined above; 
10 ...... ec so ■- effective concentration that evokes a 50% 

response la measure of the potency of an agonist) ; and 
n « the Hill coefficient, a measure of the 

cooperativity of the reaction. 

Using this equation, the effective concentration 

15 at 50% stimulation relative to 100 m m L-glutamic acid was 

determined for each dose response experiment. Figure 6 
shows a representative dose response curve for varying 
concentrations of L-glutamic acid. The potency series of 
glutamate analogs and their associated EC 50 »s are listed 

20 in Table 2. 

Table 2 

flliitamate analog Potencies (EC B J. 
Quisqualic acid 0.681 MM 
25 L-glutamic acid 12.32 MM 

Ibotehic acid 32.37 MM 

' tACPD 376 MM 

In addition, oocytes were exposed to the following 
30 L-glutamic acid analogs: aspartic acid (Tocris Neuramin) ; 

kainic acid, N-methyl-D-aspartic acid (NMDA; Sigma) , 
2-amino-4-phosphonobytyric acid (APB; Sigma) , 
o-amino-3-hydroxy-5-methyl-isoxazole-4-propionic acid 

(AMPA; Research Biochemicals iric . , Wayland, MA) at 
35 saturating concentrations and the responses were each 

normalized to a subsequ nt response to 100- MM 
L-glutamate. The L-glutamic acid analogs that were found 
to be inef fectiv were 1 mM aspartic acid, 1 mM kainic 
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acid, 100 iM NMDA + 10 iM glycine, 100 mM APB and 100 )M 
AMPA. 

Voltage clamp assays were also carried out on 
injected oocytes to measure the inhibition by the 
putative glutamate G protein-coupled receptor antagonist, 
2-amino-3-phosphonopropionic acid (AP3). Voltage clamp 
assays showed that at 1 mM, DL-AP3 (Sigma) reduced the 
current evoked by 10 iM glutamic acid to 59.3 +/- 7.3% of 
the control. 

Clone 45 cells were streaked out on LB Amp plates 
and several colonies were picked, grown up and the DNA 
isolated. Pure 4 5 -A ONA was prepared and restriction 
mapped by standard procedures. Clone 45-A has been 
deposited with the American Type Culture Collection, . 
12301 Parklawn Drive, Rockville, MD, 20852, under ATCC 
Accession No. 68497. DNA was digested with single or 
multiple enzymes. The fragments were separated on both 
1% agarose and 4% Nu si eve gels by electrophoresis. After 
electrophoresis the DNA was transferred to nitrocellulose 
filters using standard protocols for Southern transfer. 
Restriction sites were mapped based on size and based on 
hybridization to Pst I subclones of 45-A DNA. 
Additionally, the entire 45-A cDNA insert can be isolated 
by digestion with Not I restriction endonuclease. The 
Not I insert was kinased with 7- 32 P ATP, and after 
digestion of half of the sample with Bam HI to remove the 
3' label, both samples were subjected to digestion with a 
number of enzymes known to be present once in the insert. 
In this way the unique sites could be localized. A 
restriction map of GlUgR clone 45-A is shown in Figure 3. 

The entire 45-A clone was sequenced in both 
directions using the dideoxynucleotide chain termination 
method (Sanger and Coulson, J. Mol. Biol. 94:441 (1975), 
incorporated herein by reference) . Figure 5 (Sequence ID 
Nos. 1 and 2) shows the DNA sequence and deduced amino 
acid sequence of clone 45-A. Figure 5 also shows th 
location of putatiy N-link d glycosylation sit s, which 
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have been pr dieted to occur at the amino acid sequence 
Asn-X-Thr. 

As shovm in Figure 5, seven putative transmembrane 
domains have been predicted from the deduced amino acid 
sequence of clone 45-A using the method described by 
Eisenberg et al. -t. MqI. Biol. 179:125-142, (1984), 
incorporated herein by reference. Only those predicted 
to be transmembrane multimeric domains were included. An 
additional transmembrane domain (the third) was predicted 
using the method of Hopp and Woods, Proc t Natl t Acad. 
Sci. USA 78 : 3824-3838 (1981) . Based on these 
predictions, the protein encoded by clone 45-A appears to 
have two unusually large domains on the amino- and 
carboxy-termini that are not found in any of the other 
reported 6 protein-coupled receptors which have the 
common structural feature of seven predicted membrane 
spanning regions. Analysis of the deduced amino acid 
sequence of clone 45-A predicts three other hydrophobic 
stretches including one at the amino-terminus of the 
sequence. This amino-terminal hydrophobic stretch, may be 
a signal sequence, although no signal cleavage site is 
predicted downstream of the sequence. 

Poly (A) + RNA was isolated from total rat brain and 
rat cerebellum using oligo d(T) cellulose chromatography 
as described by Aviv and Leder (ibid.) . Poly (A) + RNA 
from rat. retina, rat heart, rat lung, rat liver, rat 
kidney; rat spleen, rat testis, rat ovary and rat 
pancreas were purchased from Clonetech. The poly (A) + RNA 
samples were analyzed by northern analysis (Thomas, Proc v 
H fflt | r arvid. Scl. USA 77:5201-5205 (1980) , which is 
incorporated by reference herein). The RNA was denatured 
in glyoxal, electrophoresed in agarose and transferred to 
a nitrocellulose membrane essentially as described by 
Thomas (ibid.). The northern blot was hybridized with a 
radiolabeled 3473 bp Eco Rl-Xba I fragm nt from the 45-A 
clone. Autoradiography of the bl t show d hybridization 
to a major band of approximately 7 kb and a smaller band 
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of approximately 3 .8 kb in the total rat brain and rat 
cerebellum RNA. 

Single-stranded cDNA was synthesized using 1 fig of 
the; poly (A) + RNA using Superscript reverse transcriptase 
5 (BRL) under conditions described by the manufacturer. 

One fourth of the cDNA was. used as a template for PGR 
amplification using 40 pmoles each of the GluGR-specif ic 
primers ZC3 652 (Table 1; Sequence 10 Number 14) and 
ZC3654 (Table 1; Sequence ID Number 15) and 2.5 U Taq I 

10 polymerase (Perkin Elmer Cetus, Norwalk, VA) and - 

conditions specified by the manufacturer. As an internal 
control > the PGR reaction also contained 2 pmoles each of 
the glucose-6-phosphate dehydrogenase-specif ic primers 
ZC3015 (Table 1; Sequence ID Number 12) and ZC3016 (Table 

15 1; Sequence ID Number 13) . After thirty cycles (one 

minute at 94 # C, one minute at 60*C, ninety seconds at 
72 *C), the samples were phenol-chloroform extracted and 
20% of each reaction was electrophoresed in agarose • The 
DNA was bidirectionally transferred to nitrocellulose 

20 membranes, and the filters were hybridized with either 

radiolabeled ZC3652,";ZC3654, ZC3015 and ZC3016 (Sequence 
ID Nos. 14, 15, 12 and 13, respectively) or with the 
radiolabeled Eco Rl-Xba I fragment of clone 45-A 
described above. Autoradiography of the hybridized blot 

25 showed that GIUqR transcript was mainly confined to total 

rat brain and rat cerebellum; however, longer exposures 

• v ..... 

showed a Glu^R-specif ic transcript in both retina and 
testis*. " ■ . 

Total RNA was prepared, as described above, from 
30 specific rat brain regions including frontal cortex, 

cerebellum, hippocampus, cortex, striatum; pons medulla, 
and the remainder of the brain. Single-stranded cDNA was 
synthesized as described previously using 20 /xg of total 
RNA in 50 nl using Superscript reverse transcriptase 
35 (BRL) under conditions described by the manufactur r. 

After ja one hour "incubation at 42*C, the sampl s w re 
treated with RNAse (Boehringer Mannheim Biochemicals , 
Indianapolis, IN), phenol-chloroform extracted, and 

1 *■ ■'■.«■ 



WO 92/10583 PCI7US91/09422 

>" 52 

ethanol precipitated. The samples were resuspended in 
wat r and half of each sample was subjected to PCR 
amplification. Each PCR amplification contained 40 
pmoles of each of the GlUgR-specific primers ZC3652 and 
ZC3654 described above (Sequence ID Numbers 14 and 15), 2 
pmoles of each of the glucose-6-phosphate 
dehydrogenase-specific primers ZC3015 and ZC3016 
(Sequence ID Nos. 12 and 13) and 2.5 U Taq I polymerase 
(Perkin Elmer Cetus) and conditions described by the 
manufacturer. After 35 cycles (one minute at 94 # C, one 
minute at 60 «C, ninety seconds at 72 *C) , the samples were 
phenol-chloroform extracted, and 20% of each reaction was 
electrophoresed in agarose. The DNA was transferred to a 
nitrocellulose membrane, and the filter was hybridized 
with the radiolabeled Eco Rl-Xba I fragment of clone 45-A 
described above. Autoradiography of the hybridized blots 
showed a broad distribution of the GlUgR transcript 
throughout the brain, although the frontal cortex and 
cerebellum appear to be somewhat enriched.. 

Southern analysis of rat and human genomic DNA was 
carried out using the method essentially described by 
Blin et al. mue. Acids Res. 3:2303 (1976), which is 
incorporated by reference herein). Briefly, rat and 
human genomic DNA was prepared from the rat cell line DMR 
106 (ATCC CRL 1661) and a human hepatoma ceil line (ATCC 
HTB 52), respectively. The genomic DNA was digested with 
either Eco RI or Pst I, and electrophoresed through 
agarose. The DNA was transferred to a nitrocellulose 
membrane, and the membrane was hybridized with a 
radiolabeled 1.6 kb Pst I fragment from clone 45-A. 
Autoradiography of the hybridized blot suggest that the 
human gene has a similar sequence to the rat GlUgR 
sequence, the GlUgR gene contains at least one intron, and 
that there are a small number of closely related genes. 

Expression In MffiUlll)** tan C6lls 

The ntire GIUqR cDNA insert was removed from the 
pVEGT' cloning vector by digestion with Not I and Xba I. 
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The ends w re blunted with DNA polymerase I (Klenow 
fragment) and dNTPs, and were then ligated with Eco RI 
(Smart) linkers. After linker ligation, the insert with 
Eco RI ends in kinased and ligated to Eco RI -cut and 
capped Zem228 expression vector. Bacteria were 
transformed with the ligation reaction and clones were 
characterized by restriction analysis and partial 
sequencing (see Fig. 4). 

Cultured mammalian cells, such as BHK 570 and BHK 
tsl3 served as host cells for expression. Twenty five jug 
of CsCl-purified DNA was precipitated with calcium 
phosphate and added to tissue culture cells in a 150 mm 
plate. After 4 hours the cells were subjected to a 
glycerol shock and were then put into non-selective 
medium. In some cases it may be necessary to include an 
antagonist to the GlUgR in the medium to prevent 
expression of a cytotoxic response in those cells where 
the GIUqR is expressed at levels high enough to cause a 
certain amount of autoactivation. Transiently expressed 
GIUqR ligand binding activity or PLC activation, cells are 
harvested after 48 hours. Stable expression was detected 
after 2 weeks of selection. The Zem228 expression vector 
includes a promoter capable of directing the 
transcription of the GlUgR gene, and a selectable marker 
for the bacterial neomycin resistance gene. Resistance 
to the drug G-418, an inhibitor of protein synthesis, was 
used to identify stably trans fected clones. Presence of 
the SV40 ori region on the vector allows the expression 
construction to also be used for transient expression. 
In some instances it was preferable to include DNA for 
another selectable marker, the DHFR gene, in the 
transfection protocol. Selection with both G-418 and 
methotrexate allowed isolation of clones whose expression 
of Glu^ can be subsequently amplified by the addition of 
increasingly higher concentrations of methotrexate to the 
culture medium. - 

Trans feet d c 11 lines xpressing GlUgR w r 
identified by the binding of 3 H-glutamate t membrane 
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preparations from transf ct d cells. Cell lift s 
expressing, low to moderate levels of Glu^R ar used to set 
up functional screening assays. 

Clones of BHK 570 and BHK TK"tsl3 cells expressing 
the rat G protein-coupled glutamate receptor cDNA were 
1 plated in two or three 150 mm maxi-plates culture dishes 
and were grown to confluency. The cells from each plate 
were scraped in 5 ml of PBS (phosphate buffered saline, 
Sigma Chemical Co., St. Louis, MO) , which was was. 
pre-chilled to 4 # C. The cells were removed to a chilled 
centrifuged tube, and the plates were each rinsed with 5 
ml of chilled PBS and pooled with the cells. The chilled 
tubes were spun at 1, 000 rpm for two minutes , and the 
supernatant was discarded. The cells were frozen at 
either -70 *C or on dry ice. . In some cases, the cells 
were left overnight at -70 ^C. The ceils were thawed on 
ice and were resuspended in 10 ml of a buffer containing 
30 mM Tris, pH 7.0, 2.5 mM CaCl 2 , 1 mM PMSP, which was 
pre-chilled to 4*C, by homogenizing the cells for about 
15 seconds. The suspension was poured into chilled 
centrifuge tubes . The homogenizer was rinsed with ib ml 
of the same chilled solution, and the rinse was combined 
with the suspension. The centrifuge tubes were spun for 
fifteen minutes at 40,000 x g at 4*C, and the supernatant 
was discarded. The pellet was homogenized with a buffer 
containing 30 mM Tris, pH 7.0, 2.5 mM CaCl 2 , which wais 
pre-chilled to 4'C. The' homogenizer was rinsed with the 
chilled buffer, and the rinse was combined, with the 
homogenate. The homogenate was spun as described above. 
The second homogehization was repeated on the resulting 
pellet. The final, pellet Was resuspended in between two 
and five milliliters of 30 mM Tris, pH 7.0, 2.5 mM cacl 2 , 
which was pre-chilled to;4'C. Triplicate samples were 
prepared for each plus and minus quisqualate assay point 
such that 250 Ml aliquots of each homogenate sampl were 
added to the wells of a 96-well micrbt iter plate. : To a 
buffer containing 30 mM Tris, PH 7.0, 2.5 mM caCl 2 , which 
was pre-chilled to 4 »c, a final concentration of 10 nM 
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tritiated glutamic acid was added, and the soluti n was 
split in half. To one half , quisqualate was added to a 
final concentration of 1 mM. Two hundred and fifty 
microliter aliquots of either 30 mM Tris, pH 7.0, 2.5 mM 
CaCl 2 , 5 nM tritiated glutamic acid and 500 mM 
quisqualate, or 30 mM Tris, pH 7.0, 2.5 mM CaC12, 5 nM 
tritiated glutamic acid were added to the triplicate 
samples. The samples were incubated for thirty minutes 
at room temperature. The samples were harvested onto 
glass filters and were immediately washed with ice-cold 
30 mM Tris, pH 7.0, 2.5 mM CaCl 2 under vacuum using an LKB 
1295-001 automated cell harvester (Pharmacia LKB, 
Piscataway, NJ) . The filters were dried in a microwave 
oven and counted in a gamma counter. 

Protein determinations were carried out using a 
Coomassie Blue-based assay from Pierce Chemical Company 
(Rockford, IL) under conditions! set forth by the 
manufacturer. One hundred microliters of undiluted cell 
homogenate or BSA standard was added to 2 ml of reagent 
and the optical density was measured at 595 nm. Protein 
concentrations of the samples were taken from a standard 
curve generated using the BSA standards diluted in 30 mM 
Tris, pH 7.0, 2.5 mM CaCl 2 . 

The results of these assays showed that 
quisqualate was able to competitively bind the glutamate 
receptor expressed by the transfected BHK cells. 

Functional screeni ng of agonists and antagonists 

BHK 570 cells expressing GluGR or mock-trans fected BHK 
570 cells are plated into 24-well tissue culture dishes 
at about 100,000 cells per well. After 24 hours, the 
cells are labeled with 0.2 /iCi of myo-(2- 3 H) inositol 
(specific activity - 20 Ci/mmol; New England Nuclear,) 
per well. At the end of a 24 to 48 hour incubation, the 
cells are~wash d with pr warmed DMEM (Dulb cco J s Modified 
Eagles Medium? Product-No. 51-432, JRH Biosciences, 
Lenexa, KS) which has been buffered to pH 7.4 with H pes 
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buffer (Sigma Chemical Co.) containing 10 mM LiCl, and 
are incubat d for five minutes at 37. The selected drugs 
are then added and the cells are incubated for an 
additional thirty minutes at 37* C. The reaction is 
stopped by placing the cells on ice/ and the cells are 
lysed by aspirating off the media and adding 0.5 ml of 
cold' DMEM and 0.5 ml of ice-cold 10% perchloric acid. 
After ten minutes the cell lysate is transferred to a 
tube; on ice containing 250 Ml 10 mM EDTA, pH 7.0. The 
samples are neutralized with 325 Ml of 1.5 M KOH in 60 mM 
Hepes Buffer. After the precipitates settles, 1.0 ml of 
the supernatant is applied to an Amprep minicolumn 
(Amersham, Arlington Heights, IL, RPN1908) . Inositol 
phosphates are eluted off the column and samples are 
counted in a scintillation counter. A positive response 
is indicated by an increase in labeled inositol phosphate 
levels. 



Screening for additi onal qlutamafca receptor subtypes 

Additional glutamate receptor subtypes were 
isolated using probes derived from clone 45-A. Glutamate 
receptor subtypes were isolated from a total rat brain 
cDNA library in Lambda Zap II, which was size selected 
for inserts of 3 kb before ligation (prepared for Terry 
Snutchi, Ph.D. / university of British Columbia, Vancouver, 
British Columbia, Canada by, Stratagene Cloning Systems, 
La Jolla, CA) and a rat cerebellum cDNA library in Lambda 
Zap II, which was size selected for inserts of 3 kb 
before j ligation (Stratagene Cloning Systems, La Jolla, 
CA) . I 

The total rat brain library and the rat cerebellum 
library, were plated out with I. coli XL-1 cells onto NZY 
agar plat s (Table 3) to obtain approximately 2.1 x 10 
plaques. Clon 45-A, encoding subtype la, was digested 
with Pst I to is late the 1.3 and 1.6 kb fragm nts. The 
. 45-A Pst I fragments were labeled by random priming using 
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the Am rsham rand m-priming kit (Amersham, Arlington Hts, 
IL) . Duplicate lifts wer pr pared from the plates, and 
the filters were hybridized with the probes in 50% 
formamide at 37 *C. After an overnight hybridization, the 
filters were washed in 2x SSC +0.1% SDS at 50 *C. 
Positive plaques were isolated by several rounds of 
dilution plating and repeated screening with the 
random-primed probes . 

Table 3 

NZY Agar , 

To 950 ml of deionized water, add: 

10 g NZ amine: Casein hydroly sate enzymatic (ICN 
Biochemicals) 

5 g NaCl . ■ / 

5 g bacto-yeast extract 

1 g casamino acids 

2 g MgS0 4 ' 7H 2 0 

Shake until the solutes have dissolved, Adjust to 
pH 7.0 with 5 N NaOH (approximately 0.2 ml). Adjust the 
volume of the solution to 1 liter with deionized H 2 0. 
Sterilize by autoclaving for 20 minutes. 

2 ox SSC 

Dissolve 175.3 g NaCl and 88.2 g sodium citrate in 
800 ml H 2 0. Adjust the pH to 7.0 with a few drops of 10 N 
NaOH. Adjust the volume to 1 liter with H 2 0. Sterilize 
by autoclaving. 

Plasmid DNA was prepared from positive plagues 
using the Bluescript system (Stratagene Cloning Systems) . 
The plasmid DNA was subjected to restriction analysis and 
Southern blot analysis (Sambrook et al., ibid., which is 
incbrporat d h rein by reference) . Tw clones, SN23, 
derived from the total rat brain library, and SR2, 
derived from the rat cerebellum library, were id ntified 
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as b ing differ nt than the 45-A clone and were 
sequenc d. Sequenc analysis showed that they 
represented two additional subtypes. SN23 encodes 
subtype lb, which contains an additional 85 bp exon that 
encodes a new stretch of 20 amino acids and a stop codon 
in the intracellular domain, is 292 amino acids shorter 
than the 45-A clone. The nucleotide sequence and deduced 
amino acid sequence of clone SN23 are shown in Fig. 7. 
SR2 was found to contain a partial cDNA sequence encoding 
subtype 2a, which is a novel sequence that shares a 42% 
homology to the transmembrane domains and extracellular 
domain of the 45-A clone. 

A complete subtype 2a clone was obtained by 
rescreening both libraries as described above with the 
radiolabeled 1.3 kb Fst I fragment from clone 45-A and a 
radiolabeled i;4 kb Eco RI-Pvu II. fragment from SR2. Two 
additional clones were obtained. SN30, derived from the 
total rat brain library, contained the entire subtype 2a 
coding sequence. The nucleotide sequence arid deduced 
amino acid sequence of clone SN30 are shown in Fig. 8. 
SR13, derived from the rat cerebellum library, contained 
an incomplete sequence of a new receptor subtype, 2b. 
Sequence analysis of SR13 showed that the coding sequence 
was incomplete at the 3 * end and was virtually identical 
to the SN30 sequence except that it contained a 610 base 
pair deletion within the 3' terminus of SN30. The DNA 
sequence of the cDNA insert in clone SR13 is shown in 
Figure 9. 

The complete 3 ' end of the subtype 2a clone was 
generated using PCR amplification and an oligonucleotide 
containing a sequence unique to SR13 (ZC4520, Table 4) 
and an oligonucleotide corresponding to a sequence near 
the 3' end of the 3' non-translated region of SN30 
(ZC4519, Table 4). DNA was prepared from plate lysates 
of the original plating of ach library. Each plate 
produced a pool of clones. For the PCR reactions, ten 
nanograms from each library and 100 pmol of each 
oligonucleotide were combined in a reaction volume of 50 
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Ml containing 50 mM KC1, 10 mM Tris-HCl pH 9,0, 1.5 inM 
MgCl 2 , 0.1% Triton X-100, 0.01% gelatin, 0.2 mM each 
deoxynucleotide triphosphate and 2.5 units of Thermus 
aauaticus (Taq) DNA polymerase (Promega Corporation, 
Madison, WI) . The reaction mixture was overlaid with 
mineral oil. After five cycles (30 seconds at 94 # C, 30 
seconds at 45 # C, 1 minute at 50 # C) and twenty-five cycles 
(30 seconds at 94 *C, 30 seconds at 45 # C, 1 minute at 
72 *C) the amplified DNA was removed for analysis. 

Table 4 

Degenerate Oligonucleotide Primer Sequences f5' - 3M 
ZC4519 ^ 

TTT ATT AGA AAT GTT GTC GGT 
ZC4520 

CCT CTT CCA TAT TTT TCC ATT 
ZC4559 

ATA AGA ATT CAT NKR YTT NGC YTC RTT RAA 
ZC4560 

ATA AGA ATT CTT YRA YGA RAA NGG NGA YGC 
ZC4561 

ATA AGA ATT CGC NGG NAT HTT YYT NKG NTA 
ZC4562 

ATA AGA ATT CTA NCM NAR RAA DAT NCC NGC 
ZC4563 

ATA AGA AAT CAN GTN GTR TAC ATN GTR AA 

An aliquot from each reaction was electrophoresed 
on agarose and transferred to nitrocellulose for Southern 
analysis. Southern analysis of the PGR products showed 
that a 460 bp fragment corresponding to the 3 r end of the 
2b sequence was present in several pools. One of the 
pools that produced the correct size PGR product encoding 
the 3 r sequence of the 2b subtype was diluted and 
sere ned with radiolabeled ZC4519 and ZC4520 (Table 4). 
Phag that hybridize to both radiolabeled ZC4519 and 
ZC4520 are picked, eluted, dilut d, plated and rescreen d 
with the oligonucleotide prob s. The screening is 
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repeated until a pure clone is obtained. The pure clone 
is sequenced, and a full-length clone is constructed 
using the most convenient restriction enzyme (s) . 

Based on an alignment of the deduced amino acid 
5 sequences of subtypes la and 2a/ strategies were designed 

for cloning additional subtypes using PGR amplification. 
Degenerate oligonucleotide families were prepared to 
encode conserved amino acid sequences in the sixth 
transmembrane domain, a region surrounding the conserved 
10 amino acid sequence Phe-Asp-Glu-Lys , the third 

cytoplasmic loop, and the second transmembrane domain 
(Table 4) . 

Glutamate receptor cDNA sequences were amplified 
with pairs of degenerate primers from Table 4 using the 
15 PCR method on cDNA from the total rat brain library, the 

cDNA from the rat cerebellum library, a rat cortex cDNA 
library or a rat hippocampus cDNA library (both obtained 
from Michael Brownstein, National Institutes of Health, 
Bethesda, MD) . The primers also each contained a 5 9 tail 
20 of 10 nucleotides, which provided convenient restriction 

enzyme sites. For each PCR reaction, ten nanograms from 
the library and 100 pmol of the oligonucleotide pools 
2C4563 and ZC4560 (Table 4) were combined in a reaction 
volume of 50 /il containing 50 mM KC1, 10 mM Tris-HCl pH 
25 9.0, 1.5 mM MgCl 2 , 0.1% Triton X-100, 0.01% gelatin, 0.2 

mM each deoxynucleotide triphosphate and 2.5 units of Taq 
DNA polymerase. The reaction mixture was overlaid with 
mineral oil. After five cycles (30 seconds at 94*C, 30 
seconds at 45 # C, 1 minute at 50 # C) and twenty-five cycles 
30 (30 seconds at 94*0, 30 seconds at 45*C, l minute at 

72 # C) the amplified DNA was removed for analysis. 

An aliquot from each reaction was electrophoresed 
on an agarose gel. Southern analysis of the gel was 
performed using essentially the method described by 
35 Sambrook et al. (ibid. ) and random-primed fragments 

cov ring the entir coding regions from both the subtype 
la and 2a clones. The autoradiographs showed that th 
PGR reaction gen rat d fragments of novel siz that wer 



differ nt from either the la or 2a subtyp . The 
PCRfgenerated fragments w r el ctrophoresed on an 
agarose gel. Regions corresponding to the unique-sized 
receptor-related products were excised and 
electrophoresed onto NA45 paper (Schleicher and Schuell, 
Keeiie, NH) . f : The purified fragments were recovered using 
essentially the method described by the manufacturer, 
digested with Eco RI and ligated to plasmid pVEGT 1 that 
had been linearized by digestion with Eco RI and treated 
with? phosphatase to prevent recircularizat ion . The 
ligation mixtures were trails formed into £. coli strain 
DHlOb cells. Trans formants were picked and replica 
plated onto nitrocellulose filters and screened using 
random-primed probes from the la and the 2a clones. 
Forty-eight colonies were picked for restriction analysis 
and sequencing; 

DNA sequences from the cDNA from the total rat 
brain library and the cDNA from toe rat cerebellum 
library were each amplified and analyzed using the 
methods described above and oligonucleotide ZC4559. in 
combination with either ZC4561 or ZC4559 (Table 4) . 

A rat cortex cDNA library and a rat hippocampus 
cDNA library (both obtained from Michael Browns tein, NIH) 
are subdivided into 30 pools of 10,000 colonies, Plasmid 
DNA is prepared from each pool, and the DNA is subjected 
to Southern analysis after restriction digestion of the 
pools | with Bam HI and Xho I or by PCR amplification of 
each pool using the degenerate oligonucleotides of Table 
4. The library pools containing DNA that hybridize to 
the prpbes and appear to contain a full-length cDNA are 
subdivided. The plasmid DNA is prepared and screened as 
described above. Positive pools are again divided and 
the procedure is continued until the pool is reduced to 
pure clones • The clones are subjected to restriction 
analysis and partial sequence analysis. Clones that 
represent distinct glutamate rec ptbr homologs are 
completely s qu need. Full length clones are g nerat d 
by subjecting the original pools to PCR amplification 
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using an oligonucleotide primer specific to th SP6 
promot r at the 5 ' end of the cDNA insert and an 
antisense oligonucleotide primer corresponding to the 5 1 
end of the most complete cDNA to identify pools that 
contain the longest glutamate receptor hpmolog cDNA. The 
pool is then diluted and rehybridized with the probes as 
described above to isolate a full length cDNA clone. 

Rvnrassion of Gluta mate Receptor Subtypes 
Complementary DNA sequences encoding subtypes lb 
and 2a were subcloned first into the mammalian expression 
vector Zem228R to obtain convenient terminal restriction 
sites. The cDNAs were then subcloned into pVEGT • . The 
cDNA sequence encoding subtype lb was constructed by 
replacing the 3* terminal portion of subtype la described 
in Example I with the analogous portion of subtype lb 
from SN23. Plasmid SN23 was digested with Kph I and Xba 
I to isolate the fragment containing the 3 • terminus of 
the lb subtype. The plasmid containing the subtype la 
coding sequence (45-A) in Zem228R was digested with Kpn I 
and Xba I to isolate the vector containing' fragment. The 
vector containing fragment is ligated to the Kpn I -Xba I 
fragment from SN23. The resulting plasmid comprises the 
MT-1 promoter, the subtype lb cDNA and the hGH 
terminator. This plasmid was transf ected into the BHK 
570 cell line essentially as described in Example I to 
obtain stably transf ected cell lines expressing the 
subtype lb receptor. The subtype lb cDNA fragment was 
isolated as a Bam HI fragment, which was ligated with 
pVEGT' that had been linearized with Bam HI. A plasmid 
containing the cDNA sequence in the correct orientation 
was used- to synthesize UNA in an in vitro system. The 
RNA was injected into oocytes as described above. 

Plasmid SN30, which comprises the subtype 2a cDNA, 
was digested with Eco RI to isolate the subtyp 2 a cDNA. 
The Eco RI fragment was ligated with Eco Rl-lineariz d 
Zem228R. A plasmid containing the insert in the correct 
ori ntation was digested with Bam HI t isolate the cDNA 
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sequence. The Bam HI fragment comprising the subtype 2a 
cDNA was ligated with Eco Rl-linearized pVEGT 1 . A 
plasmid containing the cDNA in the correct orientation 
was used to synthesize RNA in an in vitro translation. 
The RNA was injected into frog oocytes as described 
above. 

EXAMPLE m 

Generation of antibodies to alutamate receptor subtypes 
Receptor subtype-specific polyclonal antisera 
were generated in rabbits using standard immunization 
techniques. Synthetic peptides (Table 5) were designed 
from the cloned receptor sequences. The peptides were 
conjugated to keyhole limpet hemocyanin, and each antigen 
was used to immunize two animals. For each peptide, the 
animals were injected with 100-200 /jg of conjugated 
peptide divided among three subcutaneous sites. The 
animals were immunized at three-week intervals and bled 
via an ear vein 10 days after the third and subsequent 
immunizations . 



Subtype 


Seq. ID 
No. 


Peptide 
Seouence 


Apparent 
Location 


la 


21 


. RDSLISIRDEKDGLNRC 


extracellular 




22 • ., 


DRLLRKLRERLPKARV 


extracellular 




23 


EEWFDEKGDAPGRYD 


extracel lular 




24 . 


EFVYEREGNTEEDEL 


cytoplasmic 




25 


PERKCCEIREQYGIQRV 


extracellular 




26 


. IGPGSSSVAIQVQNLL 


extracellular 




27 


IAYSATSIDLSDKTL 


extracellular 


lb 


28 


KKPGAGNAKKRQPEFS 


cytoplasmic 




29 


PEFSPSSQCPSAHAQL 


cytoplasmic 


2a 


30 


DKIIKRLLETSNARG 


extracellular 




31 


VNFSGIAGNPVTFNEN 


extracellular 




32 . 


GEAKSELCENLETPAL 


cytoplasmic 


2b 


33 


PARLALPANDTEFSAWV 


cytoplasmic 
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Anti-peptide antibodies w re purifi d by affinity 
purification using the Proton™ Kit (Multiple Peptide 
Systems (San Diego, CA) . Purified antibodies were stored 
in column elution buffer and neutralizing buffer 
(supplied by Multiple Peptide Systems) . Bovine serum 
albumin was added to a concentration of 1 mg/ml, and 
sodium azide was added to a concentration of 0.05%. The 
antibodies were stored at 4°C or in small aliquots at 
-20*C. 

Antibodies generated from the peptides listed in 
Table 6 were used to detect G protein-coupled glutamate 
receptors by Western blot analysis of membranes prepared 
from transfected cell lines that were stably expressing 
the subtype la or subtype lb receptors. Control cell 
lines were transfected with vector alone. 

Table 6 

Analysis of Antibodi es Raised to Peptides 



Antibodies to 
Pentide Seouence 


Seg. ID 
NO, 


ideation. 


western 


RDSLIS IRDEKDGLNRC 


21 


extracellular 


+++ with bkgd 


DRLLRKLRERLPKARV 


22 


extracellular 


+ 


EEVWFDEKGDAPGRYD 


23 


extracellular 


++++ low bkgd 


EFVYEREGNTEEDEL 


24 


cytoplasmic 


++++ low bkgd 


KKPGAGNAKKRQPEFS 


28 


cytoplasmic 


+ for la 








- for lb 


PEFSPSSQCPSAHAQL 


29 


cytoplasmic 


+++ f or lb low bkgd 



Transfectants that were stably expressing either 
the la or lb subtype were each grown to conf luency in 
five to ten 150 mm plates. Each plate was first washed 
twice with 15 ml of cold PBS and then 20 ml of ice cold 
10 mM NaHC03 was added to each plate. The cells from 
each plate were scraped off the plates with a rubb r 
spatula and transf rr d to a glass dounc homog niz r n 
ice. Th cells were disrupted with ten strokes of the B 
pestle. Th homog nates from each plate wer c mbined 
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and centrifuged for thirty minutes at 3000 rpm at 4 # C. 
The pellets were resuspended in 4-8 ml of 10 xnM NaHC0 3 
using a 22 g needle and syringe, and 69% sucrose was 
added (6-12 ml) to each sample until an index of 
5 refraction of 1.410 was reached. The samples were 

transferred to a high speed cehtrifugat ion tube, and each 
sample was overlayed with 42% sucrose. The samples were 
centrifuged for two hours at 25,000 rpm at 4*C. The 
samples were collected by gently floating the membranes 

10 off the 42% sucrose layer by adding 1 ml of 10 inH NaHC0 3 

and resuspending the membranes by carefully stirring the 
upper layer. The upper layer was transferred to a fresh 
tube on ice. The purified membranes were centrifuged at 
10,000 rpm at 4*C and the pellets resuspended in 10 mM 

15 NglHC0 3 . The purified membranes were then adjusted to a 

final protein concentration of 1-2 /*g/ml. 

Ten to twenty micrograms of each purified membrane 
preparations were diluted with 2x SDS-mercaptoethanol 
buffer (100 mM Tris HCl (pH 6.8) , 200 mM dithiothreitol , 

20 4% SDS, 0.2% broraophenol blue, 20% glycerol). The t 

samples were incubated for 15 minutes at 37 *C followed by 
boiling for 5 minutes. The samples were subjected to 
SDS-PAGE on 4-15% gradient gel. The samples were 
electrotransf erred to nitrocellulose using the method 

25 essentially described by Towbin ( Proc. Natl. Acad. Sci. 

USA 76: 4350-4354, 1979? which is incorporated herein by 
reference in its entirety). After transfer, the 
nitrocellulose was cut into strips such that each strip 
contained a control and receptor samples. The 

30 nitrocellulose was preincubated in blocking buffer and 

then incubated with a dilution of either the preimmune 
serum or the serum collected after antigenic stimulation 
(serum from later bleeds (i.e. those after four antigen 
stimulations) were diluted 1:1500) . After washing, a 

35 horse radish p roxidase-conjugat d goat anti-rabbit 

antibody (Bi -Rad Laboratories, Richmond, CA) diluted 
1:2,500 was added and after incubation and washing, th 
h rse radish peroxidase substrate (Bio-Rad Laboratories) 
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was added and the color reaction was initiated. Th 
reaction was stopped by rinsing th filters in distilled 

water. Tabie 6 shows the results of the Western blot 

< • 

analysis. 



f 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Mulvihill, Eileen R. 

Hagen, Frederick S. 
Houamed, Khaled M. 
Aimers, Wolf hard 

(ii) TITLE OF INVENTION: G PROTEIN-COUPLED GLUTAMATE RECEPTORS 
(iii) NUMBER OF SEQUENCES: 33 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Townsend and Townsend 

(B) STREET: One Market Plaza, Steuart Street Tower 
9 (C) CITY: San Francisco 

(D) STATE: California 

(E) COUNTRY: USA 

(F) ZIP: 94105-1492 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible v <- 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS _ 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(Vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: „ - „ rtn , 

(A) APPLICATION NUMBER: US 07/672,007 

(B) FILING DATE: 18-MAR-1991 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/648 , 481 

(B) FILING DATE: 30-JAN-1991 

' (vii) PRIOR*" APPLICATION DATA: on - 

(A) APPLICATION NUMBER: US 07/626,806 

(B) FILING DATE: 12-DEC-1990 

(Viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Parmelee, Steven W. 

(B) REGISTRATION NUMBER: 31,990 

. (C) .REFERENCE/ DOCKET NUMBER: 13952-6PC 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE : (206) 467-9600 

(B) TELEFAX: (415) 543-5043 

(2) INFORMATION FOR SEQ ID NO:l: SUBSTITUTE SHEET 
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fi> SEQUENCE CHARACTERISTICS: 

'(A)' LENGTH* 4300 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: 45-A 

fix) FEATURE: ^ 

(A) NAME/KEY: CDS 

(B) LOCATION: 377.. 3973 

•<i) SEQUENCE DESCRIPTION: SEQ ID NO: l: 
CCGAGAACGG CTGCAGTCCT CTGACCTGAG ACCAATAGCT GTGTCTACCC GGACTCAGCG 
TCCAGCTCAC CGCCACTAAC GCGCCGCGCA TTGGACACCT GATCCACACA CCTTCGGGCA 
CCAGTGAAAA ACCGCGACTT GATTTTCTGG AAGAACGCCC CCAGGGTGTG GGAGCGGTCG 
TGGAGGACCA GCAGGAGGAA GCGGAGGGGA GAGGGGCAGT AGTGGAGGCA GAGAAAGCGT 
TGAACCAGCT GTGTTGGCCG AAGGCACGAA ACGGCAAAAG GCAGCGGTGA GCATCTGTGT 
OCTTCCCGCT GGGAACCTGC AGGCAGGACC GGCGTGGGAA CGTGGCTGGC CCGCGGTGGA 
CCGCGTCTTC GCCACA ATG GTC CGG CTC CTC TTG ATT TTC TTC CCA ATG 

% S2 SB S5 2 82 a S5 -i '25 JJ 

30 35 

s a s s s a"s= is a a - £ ss « »' " 

45 

as a s s s s 5 s «. s « " - 1 

60 65 

«c a. M «. «c sec » «e » « %l «• « £ .S 

He Gin Arg Val Glu Ala Met Phe His Tnr «*u 9Q 

80 , 

^ nnn ATC ACT CTG GGC AGT GAG ATC CGG GAC 

GAC CCG GTG CTC CTG CCC AAC ATC ACT CXG ^ ne Arg Afjp 

Asp Pro Val Leu Leu Pro Asn lie Tnr unu » i 

feiiaCTITtlTE SHEET 



60 
120 
180 
240 
300 
360 
409 

457 

505 

553 

601 

649 

697 
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69 

100 105 



TCC TGC TGG CAC TCT TCA GTG GCT CTC GAA CAG AGC ATC GAA TTC ATC 
ser 5s S-2S sS ser val Ala Leu Glu Gin Ser lie Glu Phe lie 
110 H5 ■ 

2 £ 22 S £ S.SSa § $ - - . ~ 



125 130 i35 



745 



793 



841 



889 



TGC CTG CCT GAT GGC CAG ACC CTG CCC CCT GGC AGG ACT AAG AAG CCT 

Cyt 5u Wo Sp Si? Gin Thr Leu Pro Pro Gly Arg Thr Lys Lys Pro 

140 

ATT GCT GGA GTG ATC GGC CCT GGC TCC AGC TCT GTG GCC ATT CAA GTC 
iie Ala Gly val lie Gly Pro Gly Ser Ser Ser Val Ala lie Gin Val 

160 I 65 
CAG AAT CTT CTC CAG CTG TTC GAC ATC CCA CAG ATC GCC TAT TCT GCC 937 

£n Hn 2u 2u Gin Su Phe Asp lie Pro Gin lie Ala Tyr Ser Ala 
^75 180 AO:> 

ACA AGC ATA GAC CTG AGT GAC AAA ACT TTG TAC AAA TAC TTC CTG AGG 
?S Ser IS Asp Leu Ser Asp Lys Thr Leu Tyr Lys. Tyr Phe Leu Arg 

190 195 :• <suw 

GTG CTC CCT TCT GAC ACT TTG CAG GCA AGG GCG ATG CTC GAC ATA GTC 

SS SI? sS £p Thr Leu Gin Ala Arg Ala Met Leu Asp lie val 
205 210 215 

%%%% gs a a s a a a ss« a 

220 225 230 

s a a s a a a si a a a a a a a a 

240- 245 

S 2 : S SS S2 5 2?."g K a a .25-g.SS a 

Vaa as ass a a a« 5 s a a.. 

270 275 
GCC AGG GTT GTG GTC TGC TTC TGC GAG GGC ATG ACA GTG CGG GGC TTA 

Ala Arg Val val Val cys Phe Cys Glu Gly Met Tlu val Arg e-y i«u 

285 290 • 

a s a a a-s a .a a" a 'g' s?'a 2 a a: 

300 305 

a a a ftsass-sssya s I- 

320 323 ^ 

SUBSTITUTE SHEET 



985 



1033 



1081 



1129 



1177 



1225 



1273 



1321 



1369 
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s a a a a ss-s a g a s a s s a s 

335 340 



~ ^U'-nm, r»T TAG TTC CTG AAG CTG AGG CTG GAC ACC AAC ACA 

g22 asp Sp V£ » 2u Ly, L« .u. Asp Thr Asn T*r 

350 355 

SBS si a as as| sss a 

aa a Wa a a aa a a a a 5« 

38S 



380 

^ ias sss aagt: a a. a ga 

400 4UD 

sssKSss ga a a § a a 

430' ■ 435 

as a a a a a aa a s a £'a sa 

445 450 



CCC «A i« g GAC ATT » »T =TG =»= *e ACA COT A« gc 

Pro Gly Arg Tyr Asp lie mm Asn w« jj« *jt 4g0 

. 480 

- a-s a*aa'a s g&s ss s a .a 

495 

- a a a s a e aa aa. a a ».=;= 



510 5" 



5s.ss a a a- a a s a I s a. a a. 



525 530 



W GTG . AGC TGC TGC TGG ATC TOO ACG GCC TGC AAA GAG AAT GAG TTT 

Glu val ser cys cys Trp lie cys Tnr Aia .uyj * 555 



540 545 



OK CAG GAC GAG TTC ACC TGC AGA GCC TGT GAC CTG GGG TGG TGG CCC 

SUBSTITUTE SHEET 



1417 



1465 



1513 



1561 



1609 



1657 



1705 



1753 



1801 



1849 



1897 



1945 



1993 



2041 



2089 
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val- Gin Asp- Glu Phe Thr cys Arg Ala cys Asp Leu Gly Trp Trp Pro 

. 560 565 570 

AAC GCA GAG- CTC ACA GGC TGT GAG CCC ATT CCT GTC CGT TAT CTT GAG 2137 
i£ aS gYu Thr Gly Cys Glu Pro lie Pro Val Arg Tyr Leu Glu 
575 ' 580 585 

TGG AGT GAC : ATA GAA TGT ATC ATA GCC ATC GCC TTT TCT TGC CTG GGC 2185 
sir ££< lie gIS sS lie lie Ala He Ala Phe Ser Cys Leu Gly 
590; 595 600 

ATC CTC GTG ACG CTG TTT GTC ACC CTC ATC TTC GTT CTG TAC CGG GAC 2233 

Ue 2u ?al Thr Leu Phe Val Thr Leu lie Phe Val Leu Tyr Arg Asp 
605 610 .615 

ACA CCC GTG GTC AAA TCC TCC AGT AGG GAG CTC TGC TAT ATC ATP CTG 2281 
Thr Pro val Val Lys Ser Ser Ser Arg Glu Leu cys Tyr lie lie Leu 
>,20 625 630 635 

GCT GGT ATT TTC CTC GGC TAT GTG TGC CCT TTC ACC CTC ATC GCC AAA 2329 
Ala civ ■'lie Phe Leu Gly Tyr val cys Pro Phe Thr Leu lie Ala Lys 

640 645 650 

CCT ACT ACC ACA TCC TGC TAC CTC CAG CGC CTC CTA GTT GGC CTC TCT 2377 
Pw Thr Thr Thr Ser Cys Tyr Leu Gin Arg Leu Leu Val Gly Leu Ser 
655 660 66 

TCT GCC ATG i TGC TAC TCT GCT TTA GTG ACC AAA ACC AAT CGT ATT GCA 2425 
Ser Ala Met ; cys Tyr Ser Ala Leu Val Thr Lys Thr Asn Arg lie Ala 
670! «7S 680 

CGC ATC CTG !gCT GGC AGC AAG AAG AAG ATC TGC ACC CGG AAG CCC AGA 2473 
Arg lie Leu Ala Gly Ser Lys Lys Lys lie Cys Thr Arg Lys Pro Arg 
685 1 690 



TTC ATG AGC GCT TGG GCC CAA GTG ATC ATA GCC TCC ATT CTG ATT AGT 
Me? ser Ala Trp Ala Gin Val lie lie Ala Ser lie Leu lie Ser 
700 705 710 119 



GTA CAG CTA ACA CTA GTG GTG ACC TTG ATC ATC ATG GAG CCT CCC ATG 
Val Gin Leu Thr- Leu Val Val Thr Leu He lie Met Glu Pro Pro Met 

720 725 730 

CCC ATT TTG TCC TAC CCG AGT ATC AAG GAA GTC TAC CTT ATC TGC AAT 
»o lie Leu ser Tyr Pro Ser lie Lys Glu Val Tyr Leu lie Cys Asn 
735 740 743 

*m »rr AAC CTG GGT GTA GTG GCC CCT GTG GGT TAC AAT GGA CTC CTC 
tS stl™ Su SS S3 Sa? Ala Pr Val Gly Tyr Asn Gly Leu Leu 

750 • 755 760 

ATC ATG AGC TGT ACC TAC TAT GCC TTC AAG ACC CGC AAC GTG CCG GCC 
lie Met ser Cys Thr Tyr Tyr Ala Phe Lys Thr Arg Asn Val Pro Ala 

765 770 775 

AAC TTC AAT 'GAG GCT AAA TAC ATC GCC TTC ACC ATG TAC ACT ACC TGC 

£n S'Sn S5 Ala Lys Tyr lie Ala Phe Thr Met Tyr Thr Thr cys 

QI1BRTITIJTE SHFFT 
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78Cr 785 790 795 

ATC ATC TGG CTG GCT TTC GTT CCC ATT TAC TTT GGG AGO AAC TAC AAG . 2809 
lie lie Trp Leu Ala Phe Val Pro He Tyr Phe Gly Ser Asn Tyr Lys 

800 805 810 

ATC ATC ACT ACC TGC TTC GCG GTG AGC CTC AGT GTG ACG GTG GCC CTG 2857 
lie tie Tlxr Thr Cys Phe Ala Val Ser Leu Ser Val Thr Val Ala Leu 
815 320 825 

GGG TGC ATG TTT ACT CCG AAG ATG TAC ATC ATC ATT GCC AAA CCT GAG 2905 
Gly Cys Met Phe Thr Pro Lys Met Tyr He He He Ala Lys Pro Glu 
830 835 840 

AGG AAC GTC CGC AGT GCC TTC ACG ACC TCT GAT GTT GTC CGC ATG CAC . 2953 
Arg Asn Val Arg Ser Ala Phe Thr Thr Ser Asp Val Val Arg Met His 
845 850 855 

GTC GGT GAT GGC AAA CTG CCG TGC CGC TCC AAC ACC TTC CTC AAC ATT .* 3001 
Val Gly Asp Gly Lys Leu Pro Cys Arg Ser Asn Thr Phe Leu Ash lie 
860 865 870 875 

TTC CGG AGA AAG AAG CCC GGG GCA GGG AAT GCC AAT TCT AAC GGC AAG 3049 
Phe Arg Arg Lys Lys Pro Gly Ala Gly Asn Ala Asn Ser .Asn Gly Lys 

880 885 890 

TCT GTG TCA TGG TCT GAA CCA GGT GGA AGA CAG GCG CCC AAG GGA CAG 3097 
Ser Val Ser Trp Ser Glu Pro Gly Gly Arg Gin Ala Pro Lys Gly Gin 
895 900 905 

CAC GTG TGG CAG CGC CTC TCT GTG CAC GTG AAG ACC AAC GAG ACG GCC 3145 

His Val Trp Gin Arg Leu Ser Val His val Lys Thr Asn Glu Thr Ala . 

910 915 920 

* ' * 

TGT AAC CAA ACA GCC GTA ATC AAA CCC CTC ACT AAA AGT TAC CAA GGC 3193 

Cys Asn Gin Thr Ala val lie Lys Pro Leu Thr Lys Ser Tyr Gin Gly 
925 930 935 

TCT GGC AAG AGC CTG ACG TTT TCA GAT GCC AGC ACC AAG ACC, CTT TAC 3241 
Ser Glv Lvs Ser Leu Thr Phe Ser Asp Ala Ser Thr Lys Thr Leu Tyr 
940 945 950 955 

AAT GTG GAA GAA GAG GAC AAT ACC CCT TCT GCT CAC TTC AGC CCT CCC 3289 
Asn Val Glu Glu Glu Asp Asn thr Pro Ser Ala His Phe Ser Pro Pro 

960 965 970 

AGC AGC CCT TCT ATG GTG GTG CAC CGA CGC GGG CCA CCC GTG GCC ACC 3337 
Ser Ser Pro ser Met val Val His Arg Arg Gly Pro Pro Val Ala Thr 
975 980 985 

ACA CCA CCT CTG CCA CCC CAT CTG ACC GGA GAA GAG ACC CCC CTG TTC 3385 
Thr Pro Pro Leu Pr Pro His Leu Thr Ala Glu Glu Thr Pro Leu Ph 
990 995 1000 

CTG GCT GAT TCC GTC ATC CCC AAG <5GC TTG CCT CCT CCT CTC CCG CAG 3433 
Leu Ala Asp ser Val lie Pro Lys Gly Leu Pr Pr Pr Leu Pro Gin 
1005 101° 1015 
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gs a a as £sss a.ga a a a £ 
a a a a a a a a a. a a a a a ajs- 

1040 iu« 

S S SSfJS ss a a ssa a g ss i ss - 

a as as a a gs ss'ss a ssa a a 

1070 1075 

a i .a a a;a a aa a a a is s a ss 
■a a as s ss a a a a &a a a a a, 

1100 1105 

a a a a a a a a a a a s -g a o a 

1120 1123 

si* s ssssias s ; 

CCT GCC CTG ACG CCT CCT TCT CCT TTC CGA GAT TCC GTG GCC TCT GGC 
Pro Ala Leu Thr Pro Pro Ser Pro Phe Arg Asp Ser Val^Ala ser y 
1150 1155 

a a aaa a a a a a a ra .a. a a 

1165 1170 1X7 ' 

CCA AAT GTA ACCTAC GCC TCT GTC ATT CTG AGG GAC TAC JAG CAA AGC ' 
Pro Asn Val Thr Tyr Ala Ser Val lie Leu Arg as P y jr ii9g 
1180 1185 119 

TCT TCC ACC CTG TAGTGTGTGT GTGTGTGTGG GGGCGGGGGG AGTGCGCATG 
Ser Ser Thr Leu ■' 

GAGAAGCCAG AGATGCCAAG GAGTGTCAAC CCTTCCAGAA ATGTGTAGAA AGCAGGGTGA 
GGGATGGGGA TGGAGGACCA CGGTCTGCAG GGAAGAAAAA AAAAATGCTG CGGCTGCCTT 
AAAGAAGGAG AGGGACGATG CCAACTGAAC AGTGGTCCTG GCCAGGATTG TGACTCTTGA 
ATTATTCAAA AACCTTCTCT AGAAAGAAAG GGAATTATGA CAAAGCACAA TTCCATATGG 
TATGTAACTT TTATCGAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAA 



3481 

3529 

3577 

3625 

3673 

3721 

3769 

3817 

3865 

3913 

3961 

4013 

4073 
4133 
4193 
4253 
4300 
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(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1199 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Val Arg Leu Leu Leu lie Phe Phe Pro Met II, Phe Leu Glu Met 

1 5 , 

»™ »m Tva Val Leu Leu Ala Gly Ala 
ser He Leu Pro Arg Met Pro Asp Arg Lys val i-eu ^ 

20 *° 

s. r s.r Gin Ar, S« v.1 Al. »P W »• *S "* Sly 

U. Leu Phe «r V.1 HI. Hi. A..*. ■* «» «• *" G1U 

50 55 

Ar, Ly. cy. sly siu x* «, siu «m Tyr sly n» «» ™ % 
Z mt Ph. hi. Thr U «p w »-■ w"*" v * 1 1, 

Pre A» U. Thr Z sly *« «» «• *» " P S " ^ SS " U 

100 109 

ser vl Al. «u siu sin «r II; «-• « »• 3 S " ^ 

115 120 

s« u. a* «p oiu Ly. as, «iv u.-Wm gj i- »> 817 

«, £ ym. Pro Pro Sly Ar,- Thr Ly. Ly. Pro II. Al. Sly V.1 II. 



145 ^ "° 



01, Pro sly s.r S.r S.r v.1 Al. II. g. VI «- «» - g» «" 



^_ ti« Ala Tvr Ser Ala Thr Ser lie Asp Leu 

Leu Phe Asp lie Pro Gin He Ala Tyr iga 

180 1B3 
ser Asp Ly. Thr «. xyr Ly. Tyr Ph. Leu Ar, v.1 v.1 Pro s« A., 
195 * ow 

, a« Ala Met Leu Asp He Val Lys Arg Tyr Asn Trp 

Thr Leu Gin Ala Arg Ala 22Q 

210 215 

^ ^ m s r Al. v.i His Thr sl» Sly J- Tyr sly siu s.r Sly 
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Me* Asp Ala Phe Lys Glu Leu Ala Ala Gin Glu Gly Leu Cys lie Ala 



245 



His ser Asp Lys lie Tyr Ser Asn Ala Gly Glu Lys Ser Phe Asp Arg 

260 265 

Leu Leu Arg Lys Leu Arg Glu Arg Leu Pro Lys Ala Arg Val Val Val 
275 280 28 

cys Phe cys Glu Gly Met Thr Val Arg Gly Leu Leu Ser Ala Met Arg 

" . 290 295 300 

Arg Leu Gly Val Val Gly Glu Phe Ser Leu lie Gly Ser Asp Gly Trp 

305 1 310 

Ala Asp Arg Asp Glu Val lie Glu Gly Tyr Glu Val Glu Ala Asn Gly 

325 330 

Gly lie Thr lie Lys Leu Gin Ser Pro Glu Val Arg Ser Phe Asp Asp 
340 345 

Tyr Phe Leu Lys Leu Arg Leu Asp Thr Asn Thr Arg Asn Pro Trp Phe 
355 360 

Pro Glu Phe Trp Gin His Arg Phe Gin- Cys. Arg Leu Pro Gly His Leu 

370 375 380 

Leu Glu Asn Pro Ash Phe Lys Lys Val Cys Thr Gly Asn Glu Ser Leu 

385 390 395 

Glu Glu Asn Tyr Val Gin Asp Ser Lys Met Gly Phe Val lie Asn Ala 

405 410 

lie Tyr Ala Met Ala His Gly Leu Gin Asn Met His His Ala Leu cys 

420 425; t* 3 " 

Pro Gly His Val Gly Leu Cys Asp Ala Met Lys Pro lie Asp Gly Arg 
435 440 ** 9 

Lys Leu Leu Asp Phe Leu lie Lys Ser Ser Phe Val Gly Val Ser Gly 

450 455 
Glu Glu Val Trp Phe Asp Glu Lys Gly Asp Ala Pro Gly Arg Tyr Asp 
465 470 

lie Met Asn Leu Gin Tyr Thr Glu Ala Asn Arg Tyr Asp Tyr Val His 

485 490 

,val Gly Thr Trp His Glu Gly Val Leu Asn lie Asp Asp Tyr Lys lie 



500 505 
Gin Met Asn Lys Ser Gly Met Val Arg Ser Val Cys *r Glu Pro cys 

515 520 " w 

Leu Lys Gly Gin lie Lys Val lie Arg Lys Gly Glu Val S r Cys Cys 
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Trp-Ile cys Thr Ala cys Lys Glu Asn Glu Phe Val Gin Asp Glu Phe 
545. 550 

*«» t-u Glv Trp Trp Pro Asn Ala Glu Leu Thr 
Thr cys Arg Ala Cys Asp Leu Giy irp 575 

565 

Gly cys Glu Pro lie Pro Val Arg Tyr Leu Glu Trp Ser Asp lie Glu 

580 585 
Ser lie lie Ala lie Ala Phe Ser Cys Leu Gly He Leu Val Thr Leu 



595 



600 



». VI Thr L.U U. Ph. «I »V Wg{ Pro val v,l Ly. 

610 615 

S.r Ser s.r Ar, Glu L.u cys Tyr II. IU leu II. II. Ph. J-J. 

625 ^® 

Gly Tyr V.1 cy. Pro Ph. Thr L.« II. Ala Ly. Pro Thr Thr Thr s.r 



cys Tyr L™ Gin Ar, L.U M val Gly L.»;S.r S« Ala mt Cys Tyr 

660 665 
Ser Ala Leu Val Thr Lys Thr Asn Arg tie Ala Arg lie Leu Ala Gly ; 
675 680 

s« Ly. ty. Ly. II. =ys Thr Ar, Ly. Pro Ar, Ph. M.* S«r Ala Trp 

690 695 
M. Gin VI II. II. Al. s.r II. L« II. S« val Gin L.U Thr LJU 

705 710 

val val Thr L.U II. II. »t =lu Pro Pro M.t Pro II. Lo» S.r Tyr 

725 730 

Pro s« II. Ly, Glu Val Tyr Uu II. Cys Asn T*r S« Asn L.U Gly 

740 745> 

VI val Ala pro val Gly Tyr «n Gly I-u L.u II. W* s« cy. Thr 

755 -5* 760 

Tyr Tyr Ala Ph. Ly. Thr »r, «n VI Pro Ala Asn Ph. Asn Glu Ala 

770 775 

Lys Tyr lie Ala Ph. Thr Met Tyr Thr Thr cys lie lie Trp Leu Ala 

785 790 

Ph. val Pro II. tyr Hi. Gly s« Asn Tyr Ly. II. H. Thr Thr cy. 



Ph. Ai. val s« L.u s« v.1 Thr val Ala L.U Gly cy. M.t Ph. Thr 

S20 825 

. n _ »i. tug pro Glu Arg Asn val Arg Ser 

Pro Lys Met Tyr lie He He Ala Lys Pro oiu ^ ^ 
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Ala Phe Thr Thr Ser Asp ■ Val Val Arg Met His Val Gly Asp Gly Lys 

850 855 860 

Leu Pro cys Arg ser Asn Thr Phe Leu Asn lie Phe Arg Arg Lys Lys 

865 870 875 

Pro Gly Ala Gly Asn Ala Asn Ser Asn .Gly, Lys Ser Val Ser Trp Ser 
Glu Pro Gly Gly Arg Gin Ala Pro Lys Gly Gin His val Trp Gin Arg 

900' 905 

Leu Ser Val His Val Lys Thr Asn Glu Thr Ala Cys Asn Gin Thr Ala 
915 920 

Val lie Lys Pro Leu Thr Lys Ser Tyr Gin Gly Ser Gly Lys Ser Leu 
930 935 940 

Thr Phe Ser Asp Ala Ser Thr Lys Thr Leu Tyr. Asn Val Glu Glu Glu 
945 950 955 * ou 

Asp Asn Thr Pro Ser Ala His Phe Ser Pro Pro Ser Ser Pro Ser Met 

965 ^ 70 

val Val His Arg Arg Gly Pro Pro Val . Ala Thr Thr Pro Pro. Leu Pro 
980 "5 5 " u 

Pro His Leu Thr Ala Glu Glu Thr Pro Leu Phe Leu Ala, Asp Ser Val 

995 1000 1005 

lie Pro Lys Gly Leu Pro Pro Pro Leu Pro Gin Gin Gin Pro Gin Gin 

1010 1015 1020 

Pro Pro Pro Gin Gin Pro Pro Gin Gin Pro Lys Ser Leu Met Asp Ota 
1025 1030 1 

Leu Gin Gly Val Val Thr Asn Phe Gly Ser Gly lie Pro Asp Phe His 

1045 .1050 xw^w 

Ala val Leu Ala Gly Pro Gly Thr Pro Gly Asn Ser Leu Ar^Ser Leu 
1050 1065 

Tyr Pro Pro Pro Pro Pro Pro Gin His Leu Gin Met Leu Pro Leu His 

1075 1080 1 

Leu ser Thr Phe Gin Glu Glu Ser lie Ser Pro. Pro Gly Glu Asp lie 
1090 1095 1100 

Asp Asp Asp ser Glu Arg Phe Lys Leu Leu Gin Glu Phe Val Tyr GlU Q 
1105 V 1110 - liis 

Arg Glu Gly Asn Thr Glu Glu Asp Glu Leu^Glu Glu Glu Glu Asp Leu 

1125 1130 * 

Pr Thr Ala Ser Lys Leu Thr Pr Glu Asp Ser Pro Ala Leu Thr Pr 

1140 1145 
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Pro ser Pro Phe Arg Asp Ser Val^la Ser Gly Ser serial Pro Ser 



1155 



ser Pro_Vai Ser Glu Ser va^Leu Cys Thr Pro Pro^sn Val Thr Tyr 



1170 



Ala ser val lie Leu Arg Asp Tyr Lys Gin ser Ser Ser Thr tea 
1185 * 1"* 1195 

(2) INFORMATION FOR SEQ ID NO: 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 
• (B) TYPE: nucleic acid .. 

• (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

ii) MOLECULE TYPE: cDNA 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: ZC775 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: / ..... 

GCTAGCATAA CCCCTTGGGG CCTCTAAACG GGTCT 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZC776 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
CTCAAGACCC GTTTAGAGGC CCCAAGGGGT TATGCTAGCT GCA 
(2) INFORMATION FOR SEQ ID NO: 5: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
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(vii) IMMEDIATE SOURCE: 
(B) CLONE: 2C777 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
TGAGGGGTTT TTTGCTGAAA GGAGGAACTA TGCGGCCGCA 
(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: 2C778 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
AGCTTGCGGC CGCATAGTTC CTCCTTTCAG CAAAAAACCC 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: 2C1751 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
AATTCTGTGC TCTGTCAAG 
( 2 ) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucl ic acid 

(C) STRANDEDNESS: single 
; (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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— 23/32 ■ . 

'-■:<-' 60 

CCCGGGCTCC CGGCAGTGCG AGCAGCTAAG GGCTGGCCGC CGCCTCCCTG AGCTCCCCCG 

120 

GMGCAGCCGA CCCCTGGTCG CGGCGTTCAC CTCGCCGATG CGCGGTTGGT AGGAGTGACC 

180 

GGAGCCATTC TCTCCTCGTT GATAAGATTC CCTACCAGGA TAGGAGCCTA TCTCCCTTTY 

• ' ■ 240 

CACAGCAGGA CACAGAAATC TGGCCTTCAG TACTTTGGGA AAAGGATCTG AGACCTCCTG 

300 

GAGGTCTGAC CACTGGCTGT CATCTGTGGC TCTGGCCTGT GTGGGCCACT GAGCTCTACT 

• 360 
CAAACATTAA AGAGGAGGAG GGGAGATCTG TGGAATGGGC CACCCCGTTG GCCTGCTGCA 

■ ' 420 

TTACTGAACC TGCGCTGTCC ACACGTGCCC AGATCATGGG ACCCAGGGCC TGCTAGGGCT 



10 15 



25 . v 30 



40 45 50 



55 60 65 



70 75. . 80 



85 . 90 95 

Fta 8A 





475 


GGG 


AAG 


Gly 


Lys 




523 


CTC 


CTC 


Leu 


Leu 




20 




571 


AAG 


GGT 


% 


Gly 




619 


CTG 


GGA 


Leu 


Gly 




667 


TGC 


GGG 


Cys 


Gly 




715 


CTG 


TTT 


Leu 


Phe 




763 


ATC 


ACG 


He 


Thr 




100 
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24/32 

811 

TTG GGC GCC CGC ATT CTG GAC ACC TGC TCG AGG GAC ACC CAC GCC CTG 
Leu Gly Ala Arg He Leu Asp Thr Cys Ser Arg Asp Thr His Ala Leu 

105 110 115 

859 

GAG CAG TCA CTG ACC TTT GTG CGG GCG CTC ATC GAG AAG GAC GGC ACG 
Glu Gin Ser Leu Thr Phe Val Arg Ala Leu He Glu Lys Asp Gly Thr 
120 125 130 . 

907 

GAG GTC CGC TGC GGC AGG CGG GGC CCG CCC ATC ATC ACC AAG CCC GAA 
Glu Val Arg Cys Gly Arg Arg Gly Pro. Pro He He Thr Lys Pro Glu 
135 140 145 

955 

CGA GTG GTG GGT GTC AH GGA GCT TCG GGG AGC TCC GTC TCG ATC ATG 
Arg Val Val Gly Val. lie Gly Ala Ser Gly Ser Ser Val Ser He Met 
150 155 160 

1003 

GTG GCC AAC ATC CTC CGC CTC TTC AAG ATC CCT CAG ATC AGC TAT GCC 
Val Ala Asn He Leu Arg Leu Phe Lys He Pro Gin He Ser Tyr Ala 
165 170 175 180 

1051 

TCC. ACG GCC CCT GAC TTG AGT GAC AAC AGC CGC TAT GAC TTC TTC TCC 
Ser Thr Ala Pro Asp Leu Ser Asp Asn Ser Arg Tyr Asp Phe Phe Ser 

185 190 195 

1099 

CGG GTG. GTG CCC TCA GAC ACA TAC CAG GCC CAG GCC ATG GTG GAT ATT 
Arg Val Val Pro Ser Asp Thr Tyr Gin Ala Gin Ala Met Val Asp He 
200 205 210 

1147 

GTC CGA GCC CTC AAG TGG AAC TAT GTG TCC ACA CTG GCC TCA GAG GGC 
Val Arg Ala Leu Lys Trp Ash Tyr Val Ser Thr Leu Ala Ser Glu Gly 
215 . 220 225 

1195 

AGC TAC GGT GAG AGT GGT GTG GAG GCC TTT ATC CAG AAG TCC CGA GAG 
Ser Tyr Gly Glu Ser Gly Val GTu Ala Phe He Gin Lys Ser Arg Glu 
230 ... 235 240 

• 1243 

AAC GGA GGT GTG TGC ATT GCC CAG TCG GTG AAG ATT CCA CGG GAA CCC 
Asn Gly Gly Val Cys He Ala Gin Ser Val Lys Me Pro Arg Glu Pro 
245 . 250 255 260 

1291 

AAG ACG GGG GAG TTC GAC AAG ATC ATC AAA CGCCTA CTG GAA ACA TCC 
Lys Thr Gly Glu Phe Asp Lys He lie Lys Arg Leu Leu GTu Thr Ser 

265 270 ' 275 . 

na 8& 
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25/32 1339 
AAT GCC AG6 G6T ATC ATC ATC TTT GCC AAC GAG GAT GAC ATC AGG AGG 
Asn Ala Arg Gly lie He He Phe Ala Asn Glu Asp Asp lie Arg Arg 
280 285 290 

1387 

GTG TTG GAG GCA GCT CGC AGG GCC AAC CAG ACC GGC CAC TTC TTT TGG 
Val Leu Glu Ala Ala Arg Arg Ala Asn Gin Thr Gly His Phe Phe Trp 
295 300 305 

1435 

ATG GGT TCT GAT AGC TGG GGC TCC AAG AGT GCC CCT GTG CTG CGC CTT 
Met Gly Ser Asp Ser Trp Gly Ser Lys Ser Ala Pro Val Leu- Arg Leu 
. 310 315 320 

1483 

GAG GAG GTG GCC GAG GGC GCA GTC ACC ATT CTC CCC AAG AGG ATG TCT 
Glu Glu Val Ala Glu Gly Ala Val Thr He Leu Pro Lys Arg Met Ser 
325 330 335 . . 340 

1531 

GTT CGA GGG TTC GAC CGA TAC TTC TCC AGC CGC ACG CTG GAC AAC AAC 
Val Arg Gly Phe Asp Arg Tyr Phe Ser Ser Arg Thr Leu Asp Asn Asn 

345 350 355 . 

1579 

AGG CGC AAC ATC TGG TTT GCC GAG TTC TGG GAG GAC AAC TTC CAT TGC 
Arg Arg Asn He Trp Phe Ala Glu Phe Trp Glu Asp Asn Phe His Cys 
360 365 370 ' 

. 1627 

AAG TTG AGC CGC CAC GCG CTC AAG AAG GGA AGC CAC ATC AAG AAG TGC 
Lys Leu Ser Arg His Ala Leu Lys Lys Gly Ser Hi's He Lys Lys Cys 
375 380 385 

1675 

ACC AAC CGA GAG CGC ATC GGG CAG GAC KG GCC TAT GAG CAG GAG GGG 
Thr Asn Arg Glu Arg He Gly Gin Asp Ser Ala Tyr Glu Gin Glu Gly 
. 390 395 400 

1723 

AAG GTG CAG TTC GTG ATT GAC GCT GTG TAC GCC ATG GGC CAC GCG CTG 
Lys Val Gin Phe Val He Asp Ala Val Tyr Ala Met Gly His Ala Leu 
405 410 415 420 

1771 

CAC GCC ATG CAC CGT GAC CTG TGT CCC GGC CGC GTA GGA CTC TGC CCT 
His Ala Met His Arg Asp Leu Cys Pro Gly Arg Val Gly Leu Cys Pro 

425 430 435 

1819 

CGC ATG GAC CCC GTG GAT GGC ACC CAG CTG CTT AAG TAC ATC AGG AAC 
Arg Met Asp Pro Val Asp Gly Thr Gin Leu Leu Lys Tyr lie Arg Asn 
440 445 . 450 . 

F/a 8C 
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26/32 1867 

GTC AAC TTC TCA GGC ATT GCG GGG AAC CCT GTA ACC TTC AAT GAG AAC 

Val Asn Phe Ser Gly lie Ala Gly Asn Pro Val Thr Phe Asn Glu Asn 

455 •. 460 465 

. 1915 

GGA GAC GCA CCG. GGG CGC TAC GAC ATC TAC CAG TAC CAA CTG CGC AAT 

Gly Asp Ala Pro Gly Arg Tyr Asp He Tyr Gin Tyr Gin Leu Arg Asn 
470 475 480 

1963 

GGC TCG GCC GAG TAC AAG GTC ATC GGC TCG TGG ACA GAC CAC CTG CAC 

Gly Ser Ala Glu Tyr Lys Val Tie Gly Ser Trp Thr Asp His Leu His 

485 490 495 500 

2011 

CTC AGA ATA GAG CGG ATG CAG TGG CCA GGG AGT GGC CAG CAG CTG CCG 
Leu Arg He Glu Arg Met Gin Trp Pro Gly Ser Gly Gin Gin Leu Pro 

505 510 ' 515 

• 2059 
CGC TCC ATC TGC AGT CTG CCC TGC CAG CCC GGG GAG CGA AAG AAG ACT 
Arg Ser He Cys Ser Leu Pro Cys Gin Pro Gly Glu Arg Lys Lys Thr 
520 . 525 ■ 530 

2107 

. GTG AAG GGC ATG GCT TGC TGC TGG CAC TGC GAG CCC TGC ACC GGG TAC 
Val Lys Gly Met Ala Cys Cys Trp His Cys Glu Pro Cys Thr Gly Tyr 
535 . 540 545 

2155 

CAG TAC CAA GTG GAC CGC TAC ACC TGT AAG ACC TGC CCC TAC GAC ATG 
Gin Tyr Gin Val Asp Arg Tyr Thr Cys Lys Thr Cys Pro Tyr Asp Met 
550 : 555 560 

\ . 2203 
CGG CCC ACA GAG AAC CGC ACG AGC TGC CAG CCC ATC CCC ATC GTC AAG 
Arg Pro Thr Glu Asn Arc Thr Ser Cys Gin Pro He Pro lie Val Lys 
.565 570 575 580 

2251 

TTG GAG TGG GAC TCG CCG TGG GCC GTG CTG CCC CTC TTC CTG GCC GTG 
Leu Glu Trp Asp Ser Pro Trp Ala Val Leu Pro Leu Phe Leu Ala Val 

585 590 595 

2299 

GTG GGC ATC GCC GCC ACG CTG TTC GTG GTG GTC ACG TTT GTG CGC TAC 
Val Gly He Ala Ala Thr Leu Phe Val Val Val Thr Phe Val Arg Tyr 
600 ; 605 610 

2347 

AAC GAT ACC CCC ATC GTC AAG GCC TCG GGC CGG GAG CTG AGC TAC GTG 
Asn Asp Thr Pro He Val Lys Ala Ser Gly Arg Glu Leu Ser Tyr Val 
615 620 625 

FIG. 80. 
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27/32 2395 
CTG CTG GCG GGC ATC TTT CTG TGC-TAC GCC ACT ACC TTC CTC ATG ATC 
Leu Leu Ala Gly He Phe Leu Cys Tyr Ala Thr Thr Phe Leu Met He 
630 635 640 

■ ' 2443 

GCA GAG GCG GAC GTG GGG ACC TGT TCG CTC CGC CGC ATC TTC CTA GGG 
Ala Glu Pro Asp Leu Gly Thr Cys Ser Leu Arg Arg He Phe Leu Gly 
645 . .: 650 655 660 

"'2491 

CTC GGC ATG AGC ATC AGC TAC GCG GCC CTG CTG ACC AAG ACC AAC CGC 
Leu Gly Met Ser lie Ser Tyr Ala Ala Leu Leu Thr Lys Thr Asn Arg 

665 670 675 

■: 2539 

ATT TAC CGC ATC TTT GAG CAG GGC AAA CGG -TCG GTC AGT GCC CCG CGT 
lie Tyr. Arg lie Phe Glu Gin Gly Lys Arg Ser VaV Ser Ala Pro Arg 
680 685 690 

: 2587 

TTC ATC AGC CCG GCC TCG CAG CTG GCC ATC ACC TTC ATC CTC ATC TCC 
Phe lie .Ser Pro Ala Ser Gin Leu Ala He Thr Phe lie Leu lie Ser 

695 700 . 705 .. 

2635 

CTG CAG CTG CTC GGC ATC TGC GTG TGG TTC GTG GTG GAC CCC TCC CAC 
Leu Gin Leu Leu Gly He Cys Val Trp Phe Val Val Asp Pro Ser His 
710 715 . 720 

2683 

TCG GTG GTG GAC TTC CAG GAC CAA CGG ACA CTT GAC CCC CGC TTT GCC 
Ser Val Val Asp Phe Gin Asp Gin Arg Thr Leu Asp Pro Arg Phe Ala 
725 730 735 740 

273.1 

AGG GGC GTG CTC AAG TGC GAC ATC TCG GAC CTG TCC CTC ATC TGC CTG 
Arg Gly Vail Leu Lys Cys Asp He Ser Asp Leu Ser Leu He Cys. Leu 

745 750 755 

! / 2779 

CTG GGC TAC AGC ATG CTG CTG ATG GTC ACG TGT ACT GTG TAC GCC ATC 

Leu Gly Tyif Ser Met Leu Leu Met Val Thr Cys Thr Val Tyr Ala He 

760 765 770 

■ \: ' ' . .. 2827 

AAG. ACC CGA GGC GTG CCC GAG ACC TTC AAC GAG GCC AAG CCC ATC GGC 

Lys Thr Argi Gly Val Pro Glu Thr Phe Asn Glu Ala Lys Pro lie Gly 

775i ' 780 785 

2875 

TTC ACC ATGi TAC ACC ACC TGC ATT GTC TGG CTG GCC TTC ATC CCC ATC 

Phe Thr Met; Tyr Thr Thr Cys lie Val Trp Leu Ala Phe He Pro He 

790 ! .. 795 800 

F/& 8£ 
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— 28/32 

2923 

TTT TTT GGC ACC TCA CA6 TCA GCC GAC. AAG CTG TAC ATC CAG ACA ACC 
Phe Phe Gly Thr Ser Gin Ser Ala Asp Lys Leu Tyr He Gin Thr Thr 
805 . 810 815 820 

2971 

ACA CTG ACG GTC TCC GTG AGT CTG AGC GCT TCA GTG TCC CTG GGG ATG 
Thr Leu Thr Val Ser Val Ser Leu Ser Ala Ser Val Ser Leu Gly Met 

825 830 835 

3019 

CTC TAC ATG CCC AAA GTC TAC ATC ATC.CTC TTC CAC CCG GAG CAG AAC 
Leu Tyr Met Pro Lys Val Tyr lie He Leu Phe His Pro Glu Gin Asn 
840 845 850 

■ . ' ■ 3067. 
GTG CCC AAG CGC AAG CGC AGT CTC AAA GCC GTG GTC ACC GCC GCC ACC 
Val Pro Lys Arg Lys Arg Ser Leu Lys Ala Val Val Thr Ala Ala Thr 
855 860 865 

. 3115 

ATG TCC AAC AAG TTC ACA CAG AAG GGC AAC TTC AGG CCC AAT GGG GAA 
Met Ser Asn Lys Phe Thr Gin Lys Gly Asn Phe Arg Pro Asn Gly Glu 
870 875 . 880 

3163 

GCC AAA TCA GAG CTG TGT GAG AAC CTG GAG ACC CCA GCG CTG GCT ACC 
Ala Lys Ser Glu Leu Cys Glu Asn Leu Glu Thr Pro Ala Leu Ala Thr 
885 890 . 895 900 

3209 

AAA CAG ACC TAC GTC ACC TAC ACC AAC CAT GCC ATC TAGCCGGGCC 
Lys Gin Thr Tyr Val Thr Tyr Thr Asn His Ala lie 

905 910 

3269 

GCGGAGCCAA GCAGGCTAAG GAGCCACAAC CTCTGAGGAT GGCACATTGG GCCAGGGCCG 

3329 

TTCCCGAGGG CCCTGCCGAT GTCTGCCCGC CTCCCGGGCA TCCACGAATG TGGCTTGGTG 

3389 

CTGAGGACAG TAGAGACCCC GGCCATCACT GCTGGGCAAG CCGTGGTGGG CAACCAGAGG 

. "'. • - " 3449 

AGGCCGAGTG GCTGGGGCAG TTCCAGGTTA TGCCACACAC AGGTCTTCCT TCTGGACCAC 

3509 

TGTTGGCCCA GCCCCAAAGC ACAGGGGCTC GGTCTCCAGA GCCCAGCCCT GGCTTCCTCT 

3569 

CCTTCCTCCT GCCTCCGTCT GTCCTGTGGG TGACCCCGGT TGGTCCCTGC CCCGTCTTTA 

3629 

CGTTTCTCTT CCGTCTTTGC TCTGCATGTG TTGTCTGTTT GGGCCCTCTG CTTCCATATT 

na 3F. 
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TTTCCATTCT GCTCCTGGCC 
AGCTGCCCCA TCCCCGCCAT 
GCCTTCCCCC TACTTGGGTT 
AAAGGGGGGG GGGAATCACC 
TTTCGAAGTG CTGCGTCCTC 
CGCCCTACCC GTCTGCCGtG 
AACGACACGG AGTTCAGTGC 
TTTTTATACC GAGAACATTT 



29/32 . 

TTCCCCTGCG ATCTGCCCTG 
CATTTTCTCT TCTGTTCCCC 
CATCCACCAA CTCTTTCACC 
CCCTACAAAA AAGCGCAAAC 
CTGGTGGCCT GTGTGTCCCT 
TGTCCTGCCC CCCCCGCCTG 
CTGGGTGTTT GGTGATGGTC 
CTAATAAAGA TAAACACATG 

m 8GL 
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3689 

CCCCCTGCCC CTCCTCCCTG 

3749 

GTCGATCTCA TTTCCTACCA 

3809 

ACGTTGCAAA AGAGAAAAAA 

3869 

AAAAACTAAT CTTGAGTGTG 

3929 

GTGCCTGCAG CCTGTCTGCC 

3989 

GCCGCCTTGC CCTTCCTGCT 

4049 

TCTGATGTGT AGCATGTCTG 

4096 
GTTTTGC 
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30/32 

60 

CCCAACATCA CGTTGGGCGC CCGCATTCTG . GACACCTGCT CGAGGGACAC CCACGCCCTG 

120 

GAGCAGTCAC TGACCTTTGT GCGGGCGCTC ATCGAGAAGG ACGGCACGGA GGTCCGCTGC 

180 

GGCAGGCGGG GCCCGCCCAT CATCACCAAG CCCGAACGAG TGGTGGGTGT CATTGGAGCT 

240 

TCGGGGAGCT CCGTCTCGAT CATGGTGGCC AACATCCTCC GCCTCTTCAA GATCCCTCAG 

300 

ATCAGCTATG CCTCCACGGC CCCTGACTTG AGTGACAACA GCCGCTATGA CTTCTTCTCC 

360 

CGGGTGGTGC CCTCAGACAC ATACCAGGCC CAGGCCATGG TGGATATTGT CCGAGCCCTC 

420 

AAGTGGAACT ATGTGTCCAC ACTGGCCTCA . GAGGGCAGCT ACGGTGAGAG TGGTGTGGAG 

480 

GCCTTTATCC AGAAGTCCCG AGAGAACGGA GGTGTGTGCA TTGCCCAGTC GGTGAAGATT 

540 

CCACGGGAAC CCAAGACGGG GGAGTTCGAC AAGATCATCA . AACGCCTACT GGAAACATCC 

600 

AATGCCAGGG GTATCATCAT CTTTGCCAAC GAGGATGACA TCAGGAGGGT GTTGGAGGCA 

660 

GCTCGCAGGG CCAACCAGAC CGGCCACTTC TTTTGGATGG GTTCTGATAG CTGGGGCTCC 

720 

AAGAGTGCCC CTGTGCTGCG CCTTGAQGAG GTGGCCGAGG GCGCAGTCAC CATTCJCCCC 

780 

AAGAGGATGT CTGTTCGAGG GTTCGACCGA TACTTCTCCA GCCGCACGCT GGACAACAAC 

840 

AGGCGCAACA TCTGGTTTGC CGAGTTCTGG GAGGACAACT TCCATTGCAA GTTGAGCCGC 

900 

CACGCGCTCA AGAAGGGAAG; CCACATCAAG AAGTGCAGCA ACCGAGAGCG CATCGGGCAG 

960. 

GACTCGGCCT ATGAGCAGGA GGGGAAGGTG CAGT.T.CGTGA TTGACGCTGT GTACGCCATG 

1020 

GGCCACGCGC TGCACGCCAT GCACCGTGAC CTGTGTCCCG GCCGCGTAGG ACTCTGCCCT 

1080 

CGCATGGACC CCGTGGATCG CACCCAGCTG CTTAAGTACA TCAGGAACGT CAACTTCTCA 

1140 

GGCATTGCGG GGAACCCTGT AACCTTCAAT GAGAACGGAG ACGCACCGGG GCGCTACGAC 



m sa 
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ATCTACCAGT AGCAACTGCG CAATGGCTCG GCCGAGTACA AGGTCATCGG CTCGTGGACA 



GACCACCTGC ACCTCAGAAT AGAGCGGATG CAGTGGCCAG GGAGTGGCCA GCAGCTGCCG 

■ ■' ' - - ■ ■ '••1320 
CGCTCCATCT GCAGTCTGCC CTGCCAGCCC GGGGAGCGAA AGAAGACTGT GAAGGGCATG 

. '. - : " ' ; ; . ' . 1380 
GCTTGCTGCT GGCACTGCGA GCCCTGCACC GGGTACCAGT ACCAAGTGGA GCGCTACACC 

: ." 1440 
TGTAAGACCT GCCCCTACGA CATGCGGCCC ACAGAGAACC GCACGAGCTG CCAGCCCATG 

1500 

CCCATCGTCA AGTTGGAGTG GGACTCGCCG TGGGCCGTGC TGCCCCTCTT CCTGGCCGTG 

'* ' 1560' 

GTGGGCATCG CCGCCACGCT GTTCGTGGTG GTCACGTTTG TGCGCTACAA CGATACGCCC 

; T . ;. ■ 1620 

ATCGTCAAGG CCTCGGGGCG GGAGCTGAGC TACGTGCTGC TGGCGGGCAT CTTTCTGTGC 

.1680 

TACGCCACTA CCTTCCTCAT . GATCGCAGAG CCGGACCTGG GGAGCTGTTC GCTCCGCCGC 

■ : : '-v.;. 1740 

ATCTTCCTAG GGCTCGGCAT GAGCATCAGC TACGCGGCCG TGCTGACCAA GACCAACCGC 

''. •• ' 1800 

ATTTACCGCA TCTTTGAGCA GGGCAAACGG TCGGTCAGTG CCCCGGGTTT CATCAGCCCG. 



1860 

GCCTCGCAGC TGGCCATCAC CTTCATCCTC ATCTCCCTGC AGCTGCTCGG CATCTGCGTG 

:" ' "' , : " : • .' .1920 

TGGTTCGTGG TGGACCCCTG CCACTCGGTG GTGGACTTCC AGGAGCAACG GACACTTGAC 

; "' . '■ . 1980. 

CCCCGCTTTG CCAGGGGCGT GCTCAAGTGC GACATCTGGG ACCTGTCCCT CATCTGCCTG 

'■' •' ; - -. ■ - - ! , ■ 2040 

CTGGGCTACA GCATGCTGCT GATGGTCACG TGTACTGTGT ACGCCATCAA GACCCGAGGC 

• •:• ' : ■- 2100 

GTGCCCGAGA CCTt CAACGA GGCCAAGCCC ATGGGCTTCA CCATGTACAC CACCTGCATT 

GTCTGGCTGG CCTTCATCCC CATCTTTTTT GGCACCTCAC AGTCAGCCGA CAAGCTGTAC 

2220 

ATCCAGACAA CCACACTGAC GGTCTCCGTG AGTCTGAGCG CTTCAGTGTC CCTGGGGATG 

2280 

CTCTACATGC CCAAAGTCTA CATCATCCTC TTCCATATTT TTCGATTCTG CTCCTGGCGT 
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32/32 '2340 

TCCCCTGCCA TCTGC^CTGC CCCCTGCCCC TCCTGCCTGA GCTGCCGCAT CCCCGCCATC 

2400 

ATTTTCTCTT CTGTTCCCCC TCGATCTCAT TTCCTACCAG CCTTCCCCCT ACTTGGCTTC 

• ■ \ ■ • 2426 • 
CTCCACCAAC TCTTTCACCA CGTTGC 



na 9C 
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-fvii) IMMEDIATE SOURCE: 
. (B) CLONE: ZC1752 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 
GATCCTTGAC AGAGCACAG 
(2) INFORMATION FOR SEQ ID NO: 9: 

fiV SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base > pairs 

(B) TYPE: nucleic acid 

\ (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

■S. ' • • 

fvii) IMMEDIATE SOURCE: 
(B) CLONE: ZC2063 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 9: 
GATCCAAACT AGTAAAAGAG CT 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOIOGY: linear 

(ii) MOLECULE TYPE: cDNA 

.• / ' ' ' _ ,• • • 

fvii) IMMEDIATE SOURCE; 

(B) CLONE: ZC2064 

(act) SEQUENCE DESCRIPTION SEQ ID NO:10: 
CTTTTACTAG TTTG 

(2) INFORMATION FOR SEQ ID NO: 11: 

(L) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic/ acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: cDNA 

ernncTITUTE SHEET 
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(Vii) IMMEDIATE SOURCE! 

(B) CLONE: ZC2938 



(xij SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GACAGAGCAC AGATTCACTA GTGAGCTCTT TTTTTTTTTT TTT 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bas$ pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE:, cDNA 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: 2C3015 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TTCCATGGCA CCGTCAAGGC T 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZC3016 

*• . ■ ■ ■ ■ ' 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
AGTGATGGCA TGGACTGTGG T 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucl ic acid 

(C) STRANDEDNESS: singl 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

SUBSTITUTE SHEET 
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•(vii) IMMEDIATE SOURCE: 
(B); CLONE: ZC3652 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

' ■ 20 ■ 

ACATGCACCA TGCTCTGTGT 
(2)' INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear ^ 

(ii) MOLECULE TYPE: cDNA 

, . ■ • . -.. . . ■ _ ■* 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZC3654 

(xi) SEQUENCE DESCRIPTION:: SEQ ID NO: 15: 

. *" 21 

AGTGATGGCA TGGACTGTGG T 

(2) INFORMATION FOR SEQ ID NO: 16: 

Ci) SEQUENCE CHARACTERISTICS:- 

(A) LENGTH: 5236 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: SN23 

(ix) FEATURE: 

(A) KAKB/KEY: CDS 

(B) LOCATION: 627. .3344 



lx*) SEQUENCE DESCRIPTION J SEQ ID HO: 16: 
TAAGAATTTT ATAAATACTC TGGGAATTTT ATTGGTGATG CCTTTGTGTC TACAGGGCAC 
ACGTTCCAGA GAdCTCTGGT GTGAAGTGAT GGGGGACTTG TGGCTAGAGA AGCTTTTCAA 
TGGCCTTAAA CTCTGGGTCC TGCTTGAGAG AGGTCTGAGG TTCTCAACAT CAGAGCAGAG 
CTTCCACCAA GCTTTCAGAA TGCTAAGCCC CCACTTCTCA ACACTTAGTG CTCTGATCGG 
TGCCTGCGAA CCGAGAACGG CTGCAGTCCT CTGACCTGAG ACCAATAGCT GTGTCTACCC 
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180 
240 
300 
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GGXCTCAGCG TCCAGCTCAC CGCCACTAAC GCGCCGCGCA TTGGACACCT GATCCACACA 
CCTTCGGGCA CCAGTGAAAA ACCGCGACTT GATTTTCTGG AAGAACGCCC CCAGGGTGTG 
GGAGCGGTCG TGGAGGACCA GCAGGAGGAA GCGGAGGGGA GAGGGGCAGT AGTGGAGGCA 
GAGAAAGCGT TGAACCAGCT GTGTTGGCCG AAGGCACGAA ACGGCAAAAG GCAGCGGTGA 
GCATCTGTGT GGTTCCCGCT GGGAACCTGC - AGGCAGGACC GGCGTGGGAA CGTGGCTGGC 
CCGCGGTGGA CCCCGTCTTC GCCACA ATG GTC CGG CTC CTC TTG. ATT TTC TTC 

ccx ATG ATC TTT TTG GAG ATG TCC ATT TTG CCC AGG ATG CCT GAC AGA 
Pw Mel 55 Leu GlS Met ser lie Leu Pro Arg Met Pro Asp Arg. 
10 15 20 

AAA GTA TTG CTG GCA GGT GCC TCG TCC CAG CGC TCC GTG GCG AGA ATG : 
*£s y3 Leu Leu Ala Gly Ala Ser Ser Gin Arg Ser Val Ala Arg Met 

CAC GG A GAT GTC ATC ATC GGA GCC CTC TTC TCA GTC CAT CAC CAG CCT 
Sp Tly Sp Sal 111 lie Gly Ala Leu Phe Ser val His His Gin Pro 

• * 45 . x 50 55 

CCA GCC GAG AAG GTA CCC GAA AGG AAG TGT GGG GAG ATC AGG GAA CAG 
So* Ala Glu Lys val Pro Glu Arg Lys Cys Gly .Glu lie Arg Glu Gin 

60 65 

TAT GGT ATC CAG AGG GTG GAG GCC ATG TTC CAC ACG TTG GAT AAG ATT 

Sr cty lie Gin Arg Val Glu Ala. Met Phe His Thr Leu Asp Lys lie 

75 80 85 

iac eCG GAC CCG GTG CTC CTG CCC AAC ATC ACT CTG GGC ACT GAG ATC 
£n HI Sp P?S S3 SS Leu Pro Asn lie Thr Uu Gly Ser Glu lie 

90 95 100 

CGG GAC TCC TGC TGG CAC TCT TCA GTG GCT CTC GAA CAG AGC ATC GAA 
lf q 2p sS Trp His. ser Ser val Ala Leu Glu Gin Ser lie Glu . 

HO 115 

TTC ATC AGA GAC TCC CTG ATT TCC ATC CGA. GAT GAG AAG GAT GGG CTG 
Fte'm A*J Asp Ser Leu lie Ser lie Arg Asp Glu Lys Asp Gly Leu 
125 130 

^ rr* Tee CTG CCT GAT GGC CAG ACC CTG CCC CCT GGC AGG ACT AAG 
£S £S Pro S£ oV ? ln Thr Pr= Pro Cly Ar, T»r Lys 

£ s s s s '« S S S5 w is i m - s 

155 160 

CAA GTC CAG AAT CTT CTC CAG CTG TTC GAC ATC CCA CAG ATC GCC TAT 
%n Sn Asn Leu Leu Gin Leu Phe Asp lie Pr Gin II Ala Tyr 
170 " 5 180 
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1229 



s s s H a' is s » s; $ s a ss ss i 

190 "=> 



aa'asss as s ss ss s £ s? g s s 

205 210 



■ Wr <rre TCA GCA GTC CAC ACA GAA 

ATA GTC AAG CGT TAC AAC TGG ACC-TAT GTC TCA Thr 6lu 

lie val Lys Arg Tyr Asn Trp Thr Tyr vai ser JJ() 



220 225 



S S 5! SS S S - * ? - i » = 2? S 

235 240 

« ere «c atc « .gc tcg «« m «. ™ g »* g 

sin f-lu Gly Leu Cys He Ala His ser asp jyj * / 2gg 

250 .255 

«* «« » a «= t£ j«ggg»!jiss & s; s 

Gly Glu Lys Ser Phe Asp Arg Leu ^ u ~| u * ' 280 

270 ^' 

• s {j : 'SS 28' S5 « a 5! ~ .s « « » i - * 

285 29U 

« ™ 2 s IS ■ S S S 25 - "5 S 52 SE J2 12 

Gly Leu Leu Ser Ala net Arg at? j . 310 



300 305 



CTC ATT GGA AGT GAT GGA TGG GCA GAC AGA GAT GAA GTC ATC GAA GGC 
Leu He Gly Ser Asp Gly Trp Ala Asp Arg ***e ^ 
315 320 

Tyr Glu V«l Slu Ma sl » G1 * "* ™ r ' ■ 343 



330 335 



W«j. =: gB «e «j« »• 2« j-; S-S5 g 

Glu Val Arg Ser Phe Asp Asp Tyr .ww j ,360 

350 3 " 



365 370 



380 385 



395 400 



1277 



1325 



1373 



1421 



1469 



1517 



1565 



1613 



1661 



1709 



1757 



1805 



1853 
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Met- Gly Phe Val He Asn Ala lie Tyr Ala Met Ala His Gly Leu Gin 
410 415 420 425 

AAC ATG CAC CAT GCT CTG TGT CCC GGC CAT GTG GGC CTG TGT GAT GCT 1949 
Asn Me? His His Ala Leu Cys Pro Gly His Val Gly Leu Cys Asp Ala 

430 435 440 

ATG AAA CCC ATT GAT GGC AGG AAG CTC CTG GAT TTC CTC ATC AAA TCC 1997 
Met Lys Pro lie Asp Gly Arg Lys Leu Leu Asp Phe Leu lie Lys ser 
445 ' 450 455 

TCT TTT GTC GGA GTG TCT GGA GAG GAG GTG TGG TTC GAT GAG AAG GGG 2045 
Ser Phe Val Gly Val Ser Gly Glu Glu Val Trp Phe Asp Glu Lys Gly 
460 465 470 

GAT GCT CCC GGA AGG TAT GAC ATT ATG AAT CTG CAG TAC ACA GAA GCT 2093 
Asp Ala Pro Gly Arg Tyr Asp lie Met Asn Leu Gin Tyr Thr Glu Ala 

> A 75 480 485 

AAT CGC TAT GAC TAT GTC CAC GTG GGG ACC TGG CAT GAA GGA GTG CTG 2141 
£n £g £ Sp TyV Val His Val Gly Thr Trp His. Glu Gly Val Leu 
490 495 500 505 

AAT ATT GAT GAT, TAC AAA ATC CAG ATG AAC AAA AGC GGA ATG GTA CGA 
£sn lie Asp Asp Tyr Lys lie Gin Met Asn Lys Ser Gly Met Val Arg 

510 515 920 

TCT GTG TGC AGT GAG CCT TGC TTA AAG GGT CAG ATT AAG GTC ATA CGG 

ser Val Cys Ser Glu Pro Cys Leu Lys Gly Gin lie Lys Val lie Arg 
525 530 535 . 

AAA GGA GAA GTG AGC TGC TGC TGG ATC TGC ACG GCC TGC AAA GAG AAT 



£ ZJ Glu" Val Ser Cys" cjs Tri lie Cys Thr Ala cys Lys Glu Asn 

540 5*5 550 

GAG TTT GTG CAG GAC GAG TTC ACC TGC AGA GCC TGT GAC CTG GGG TGG 
Glu Phe Val Gin Asp Glu Phe Thr Cys Arg Ala Cys Asp Leu Gly Trp 
555 560 565 

TGG CCC AAC GCA GAG CTC ACA GGC TGT GAG CCC ATT CCT GTC CGT TAT 
5£ S Asn St oiii Leu Thr Gly Cys Glu Pro lie Pro Val Arg Tyr 
570 575 580 S8S 

CTT GAG TGG AGT GAC ATA GAA TCT ATC ATA GCC ATC. GCC TTT TCT TGC 

Asp 
590 

CTG GGC ATC CTC GTG ACG CTG TTT GTC ACC CTC ATC TTC GTT CTG TAC 

Leu 
605 



CTT GAG TGG AGT GAC ATA GAA TCT ATt aia www www 

3u Trp Set Asp lie Glu Ser He lie Ala He Ala Phe Ser Cys 

CTG GGC ATC CTC GTG ACG CTG TTT GTC ACC CTC ATC TTC GTT CTG ~~ 
Leu Gly H Leu Val Thr Leu Phe Val Thr Leu lie Phe Val Leu Tyr 
605 610 615 

CGG GAC ACA CCC GTG GTC AAA TCC TCC AGT AGG GAG CTC TGC TAT ATC 
Arg Asp Thr Pr Val Val Lys Ser Ser S r Arg Glu Leu cys Tyr x A e 



620 



ATT CTG GCT GGT ATT TTC CTC GGC TAT GTG TGC CCT TTC ACC CTC ATC 
lie Leu Ala Gly He Phe Leu Gly Tyr Val Cys Pro Phe Thr Leu He 
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640 645 



SSSSSSSS3SS2S225S 

650 655 

CTC TCT TCT GCC ATG TGC TAC TCT GCT TTA GTG ACC AAA ACC AAT CGT 
2u 1512 Sa Met Cys Tyr Ser Ala Leu Val Thr Lys Thr Asn Arg 

670 675 

S 2 S S 2 S S 2 3S * 2 2.35 2 £5. 

685 690 



si £ s 2 is s s M 2 s a s 2 a? 

700 705 

715 ■ ■ J 720 2 

CCC ATG CCC ATT TTG TCC TAC CCG AGT ATC AAG GAA GTC TAC CTT ATC 
Pro Met Pro lie Leu Ser Tyr Pro Ser lie Lys Glu vai Tyr umu ^ 

730 735 

■sss s 2 as s s aas-2 »sj«s.s 

750 755 

CTC CTC ATC ATG AGC TGT ACC TAC TAT GCC TTC AAG ACC CGC AAC GTG 
Zu 2u Me? Ser Cys Thr Tyr Tyr Ala Phe Lys Thr Arg. Asn Val 
765 770 

555 2 ^ S 2 2 E 2 2 2.2 2 2 2i 25 S 

780 785 

ssssssssssssfj 2:2.225. 

795 * 800 

££22222 S 2 2! 2 2 25 2T 2 2? 
■S22S2:52222S22222 

830 

;252222222§22r5Sg*2 



845 



860 865 

eiiDCTiTiiTF curer-r 



2621 



2669 



2717 



2765 



2813 



2861 



2909 



2957 



3005 



3053 



3101 



3149 



3197 
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ArP nc CGG AG A AAG AAG CCC GGG GCA GGG AAT GCC AAG, AAG AGG 

ffi i" "e 2? £g tys Lys Pro Gly Ala Gly Asn Ala Lys Lys Arg 

875 . 880 885 

as ess iss? k s js as si g.s?. : ss s ssss 

890 895 

CTT TGAAAACCCC CACACTGCAG TGAATGTTTC TAACGGCAAG TCTGTGTCAT 
Leu 



GGTCTGAACC AGGTGGAAGA 
TGCACGTGAA GACCAACGAG 
AAAGTTACCA AGGCTCTGGC 
ACAATGTGGA AGAAGAGGAC 
CTATGGTGGT GCACCGACGC 
TGACCGCAGA AGAGACCCCC 
CTCCTCTCCC GCAGCAGCAG 
AGTCCCTGAT GGACCAGCTG 
TCCATGCGGT GCTGGCAGGC 
CCCCGCCTCC GCCGCAACAC 
AGTCCATCTC CCCTCCTGGG 
AGGAGTTCGT GTACGAGCGC 
ACCTGCCCAC AGCCAGCAAG 
CTTTCCGAGA TTCCGTGGCC 
TCCTCTGCAC CCCXCGAAAT 
GCTCTTCCAC CCTGTAGTGT 
CCAGAGATGC CAAGGAGTGT 
GGGATGGAGG ACCACGGTCT 
AGGAGAGGGA CGATGCCAAC 
TCAAAAACCT TCTCTAGAAA 
AACTTTTATC GAAAAAAATA 

TTTGCTCAAT CGTGCATACA 

i 

CAGGCCCTGC GATGGTGCCA 



CAGGCGCCCA AGGGACAGCA 
ACGGCCTGTA ACCAAACAGC 
AAGAGCCTGA CCTTTTCAGA 
AATACCCCTT CTGCTCACTT 
GGGCCACCCG TGGCCACCAC 
CTGTTCCTGG CTGATTCCGT 
CCACAGCAGC CGCCCCCTCA 
CAAGGCGTAG TCACCAACTT 
CCGGGGACAC CAGGAAACAG 
CTGCAGATGC TGCCCCTGCA 
GAGGACATCG ATGATGACAG 
GAAGGGAACA CCGAAGAAGA 
CTGACCCCTG AGGATTCTCC 
TCTGGCAGCT CAGTGCCCAG 
GTAACCTACG CCTCTGTCAT 
GTGTGTGTGT GTGGGGGCGG 
CAACCCTTCC AGAAATGTGT 
GCAGGGAAGA aaaaaaaaaa 
TGAACAGTGG TCCTGGCCAG 
GAAAGGGAAT TATGACAAAG 
ATAAAACGTA AAAATAAAAT 
TATATCTGCC CACACTCCCG 
ACTGAATCCT AAGTTCATCA 



CGTGTGGCAG 



CGTAATCAAA 
TGCCAGCACC 
CAGCCCTCCC 
ACCACCTCTG 
CATCCCCAAG 
GCAGCCCCCG 
CGGTTCGGGG 
CCTGCGCTCT 
CCTGAGCACC 
TGAGAGATTC 
TGAATTGGAA 
TGCCCTGACG 
TTCCCCCGTA 
TCTGAGGGAC 
GGGGAGTGCG 
AGAAAGCAGG 
TGCTGCGGCT 
GATTGTGACT 
CACAATTCCA 
CAACAAAAAT 
TGGTAAAACT 
TCCTAGTGAG 



CGCCTCTCTG * 
CCCCTCACTA 
AAGACeCTTT 
AGCAGCCCTT 
CCACCCCATC 
GGCTTGCCTC 
CAGCAGCCCA 
ATTCCAGATT * 
CTGTACCCGC 
TTCCAGGAGG 
AAGCTCCTGC 
GAGGAGGAGG 
CCTCCTTCTC 
TCTGAGTCGG 
TACAAGCAAA 
CATGGAGAAG 
GTGAGGGATG 
GCCTTAAAGA 
CTTGAATTAT 
TATGGTATGT 
AATCTCTTCT 
AGAAGCGAAG 
CAGATGG AGA 



3293 
3341 
3394 

3454 

3514 

3574 

3634 

3694 

3754 

3814 

3874 

3934 

3994 

4054 

4114 

4174 

4234 

4294 

4354 

4414 

4474 

4534 

4594 

4654 

4714 

4774 
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GASGGCAGGA GGCGAGAGGG CAGGAGGCGG GGGTAGGTTC GGACAACAGC TCCCATCTCA 
GACCTTGACT GTGCTGAGTC TTCAGACTCC TGGACTAAGG AAGACCCGGG GACTGACCTT 
ATGAGGGTCC CTTTCCACTG CTGTGATCCA TTGCCAGCCT GTAGTCACCC GGGATAAAGG 
CACAGTAACC TTTTGCATTC CTGTGATTCC CTGTGTTTAA GGAAAAGGAA AGTATGAGCA 
AAGCTATCAC CAAAAAGAGC GCCATTAGAA GTTACGGGGG AGAAAAAAAG AGAAGCAAGA 
TGATATATAA GCACAGGGCC TTGAACAAGG TGAGCGTGCT TCACAGATTC CGTATTAATG 
TACAGATACT TTTGGAGAGG AGAAAGATAA CAAGGAGTGT CAGGCCGTTT GTGAACTCAC 
TTGCACTGTG CCAACCAGGT TCTCCGCTGC CCTTCAGCAA AA 

4g) INFORMATION FOR SEQ ID HO: 17: 

( i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 906 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
Met val Arg Leu Leu Leu lie Phe Phe Pro Met lie Phe Leu Glu Met 



10 



Ser lie Leu Pro Arg Met Pro Asp Arg Lys Val Leu Leu Ala Gly Ala 



20 25 



ser Ser Gin Arg Ser Val Ala Arg Met Asp Gly Asp val lie- lie Gly 

35 40 
Ma Leu Phe Ser val His His Gin Pro Pro Ala Glu Lys Val Pro Glu 

50 55 



xrg L ys cys Gly. Glu lie Arg Glu Gin Tyr Gly He Gin Arg val Glu. 
Z Met Phe His Thr Leu Asp Lys lie Asj Ala Asp Pro Val Leu Leu 
Pro Asn lie Thr Leu Gly Ser Glu lie Arg Asp Ser cys Trp His Ser 



100 i° 5 



ser val Ala Leu Glu -Gin Ser lie Glu . Phe lie Arg Asp Ser Leu lie 
115 120 

ST U. AT, MP Glu Ly* A» GIV «9 W «» W. 

130 135 



Glh Thr U» Pro Pro. ? ly «, Thr Ly. W . go II U. Gly «1> II. 

145 150 
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Gly-Pro Gly Ser Ser Ser Val Ala lie Gin Val Gin Asn Leu. Leu Gin 

165 170 1'5 

Leu Phe Asp He Pro Gin He Ala Tyr ser Ala Thr Ser lie Asp Leu 

180 • , 185 190 

ser Asp Lys Thr Leu Tyr Lys Tyr Phe Leu Arg Val Val Pro ser Asp 

135 200 205 

Thr Leu Gin Ala Arg Ala Met Leu Asp lie Val Lys Arg Tyr Asn Trp 
210 215 220 

Thr Tyr Val Ser Ala val His Thr Glu Gly Asn Tyr Gly Glu Ser Gly 
225 230 235 240 

Met Asp Ala Phe Lys Glu . Leu Ala Ala Gin Glu Gly Leu Cys lie Ala 

245 250 255 

His ser Asp Lys lie Tyr ser Asn Ala Gly Glu Lys Ser Ph* Asp Arg 

260 265 , 270 

Leu Leu Arg Lys Leu Arg Glu Arg leu Pro Lys Ala Arg Val val val 
275 280 285 

Cys Phe cys Glu Gly Met Thr Val Arg Gly Leu Leu Ser Ala Met Arg 
' 290 295 300 

Arg Leu Gly Val Val Gly Glu Phe Ser Leu lie Gly Ser Asp Gly Trp 
305 310 315 320 

Ala Asp Arg Asp. Glu Val lie Glu Gly Tyr Glu Val Glu Ala Asn Gly 
Gly lie Thr He Lys Leu Gin Ser Pro Glu Val Arg Ser Phe Asp Asp 

340 345 350 

Tyr Phe Leu Lys Leu Arg Leu - Asp Thr Asn Thr Arg Asrv Pro Trp Phe 
355 360 365 

Pro Glu Phe Trp Gin His Arg Phe Gin Cys Arg Leu Pro Gly His Leu 
370 375 380 

Leu Glu Asn Pro Asn Phe Lys Lys Val Cys Thr Gly Asn Glu Ser Leu 

385 390 395. 400 

Glu Glu Asn Tyr Val Gin Asp Ser Lys Met Gly Phe Val lie Asn Ala 

405 410 4A3 

lie Tyr Ala Met Ala His Gly Leu Gin Asn Met His His Ala L u Cys 
420 425 430 

Pro Gly His val Gly Leu Cys. Asp Ala Met. Lys Pro lie Asp Gly Arg 
435 440 -44d 

Lys Leu Leu Asp Phe Leu lie Lys Ser Ser Phe Val Gly. val Ser Gly 
450 455 460 
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Glu Glu val Trp Phe Asp Glu Lys Gly Asp Ala Pro Gly Arg Tyr Asp 
465 470 4 

lie Met Asn Leu Gin Tyr Thr Glu Ala Asn Arg Tyr Asp Tyr Val His 

485 4 * u 
val Gly Thr Trp His Glu Gly Val Leu Asn lie Asp Asp Tyr Lys lie 



500 



Gin Met Asn Lys Ser Gly Met Val Arg Ser Val Cys Ser Glu Pro Cys 

Leu Lys Gly Gin. lie Lys Val lie Arg Lys Gly Glu Val Ser Cys Cys 

530 • , 535 54 

Trp lie cys Thr Ala Cys Lys Glu Asn Glu Phe Val Gin Asp Glu Phe 
545 550 595 

Thr cys Arg Ala Cys Asp Leu Gly Trp Trp Pro Asn Ala Glu Leu Thr 



565 



Gly Cys Glu Pro lie Pro Val Arg Tyr Leu Glu Trp ser Asp lie Glu 

580 585 
ser lie lie Ala lie Ala Phe Ser Cys Leu Gly He Leu Val Thr Leu 

595 600 
Phe val Thr Leu lie Phe Val Leu Tyr Arg Asp Thr Pro Val Val Lys 

610 615 
ser ser Ser Arg Glu Leu Cys Tyr lie lie Leu Ala Gly lie Phe Leu 
625 630 

Gly Tyr Val cys Pro Phe Thr Leu lie Ala Lys Pro Thr Thr Thr Ser 
cys Tyr Leu Gin Arg Leu Leu Val Gly Leu Ser Ser Ala Met cys Tyr 

660 " 5 

ser Ala Leu Val Thr Lys Thr Asn Arg lie Ala Arg lie Leu Ala Gly 

675 J- 680 

ser Lys Lys Lys lie Cys Thr Arg Lys Pro Arg Phe Met Ser Ala Trp 

690 6 " 

«. Gin v.1 II. II. AU s.r U. L» II. s« V.1 sin «« Thr Uu 

705 710 

V.1 VI Thr L.U II. II. M« Gl" Pr Pro M.t Pro II. t*u S.r Tyr 

725 730 

Pre ser lie Lys Glu Val Tyr Leu lie Cys Asn Thr S r Asn Leu Gly 

740 745 

v.1 V.1 XI. Pr V.1 Gly Tyr «n Gly L.U I*> H.t S«r Cy. Thr 

755 760 
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Tvr Tyr Ala Phe Lys Thr Arg Asn Val Pro Ala Asn Phe Asn Glu. Ala 
770 775 780 

Lys Tyr He Ala Phe Thr Met Tyr Thr Thr Cys He lie Trp Leu Ala 
785 790 . 795 800 

Phe Val Pro lie Tyr Phe Gly Ser Asn Tyr Lys lie lie Thr Thr Cys 

805 810 815 

Phe Ala val Ser Leu Ser Val Thr Val Ala Leu Gly Cys Met Phe Thr 
820 825 8 

Pro Lys Met Tyr He He He Ala Lys Pro Glu Arg * Asn Val Arg Ser 
835 840 845 

Ala Phe Thr Thr Ser Asp Val val Arg Met His val Gly Asp Gly Lys 
850 855 860 

Lku Pro cys Arg Ser Asn Thr Phe Leu Asn He Phe Arg Arg Lys Lys. 
.865 870 8 ?5 880 

Pro Gly Ala Gly Asn Ala Lys Lys Arg Gin Pro Glu Phe Ser Pro Ser 

885 890 

Ser Gin Cys Pro Ser Ala His Ala Gin Leu 
900 905 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4095 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: SN30 

(ix) FEATURE: 

(A) NAME/KEY: CDS ' 

(B) LOCATION: 463.. 3198 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CCCGGGCTCC CGGCAGTGCG AGCAGCTAAG GGCTGGCCGC CGCCTCCCTG AGCTCCCCCG 
GAGCAGCCGA CCCCTGGTCG CGGCGTTCAC CTCGCCGATG CGCGGTTGGT AGGAGTGACC 
GGAGCCATTC TCTCCTCGTT GATAAGATTC CCTACCAGGA TAGGAGCCTA TCTCCCTTTC 
ACAGCAGGAC ACAGAAATCT GGCCTTCAGT ACTTTGGGAA AAGGATCTGA GACCTCCTGG 
AGCTCTGACC ACTGGCTGTC ATCTGTGGCT CTGGCCTGTG TGGGCCACTG AGCTCTACTC 
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AAACATTAAA GAGGAGGAGG GGACATCTGT CGAATCGGCC ACCCCCTTCG CCTGCTCCAT 
TACTGAACCT GCGCTGTCCA CACGTGCCCA GATCATGGGA CCCAGGGCCT GCTAGGGCIA 
GGAGCGGGGC CCACTATTCA TGGGTCTCTA GGCCTTTCCG AA ATG TCC GGG AAG 

l 

GGA GGC TGG GCC TGG TGG TGG GCC CGG CTG CCC CTC TGC , CTA CTC CTC 
Gly Gly Trp Ala Trp Trp Trp Ala Arg Leu Pro lau wy 20 



5- 10 



mm rvr PCT TCA TCC TTG GGA AAG CCC AAG GGT 

S-2 ™ S S £ & » S If «» G!y «. ». ty. Gly 

25 30 
CAC CCC CAC ATG AAC TCT ATC CGA ATT CAC GGG GAC ATC ACA CTG GGA 
His 'ro His Met Asn Ser lie Arg lie Asp exy ^ 

*" 40 45 



£ y S S5 & 2S G15 s ss ss.ss.ig s sss 



55 60 



„„„ ... ~e ATC cAC CGC CTG GAG GCC ATG .CTG TTT 

GAG CTG AAG AAG GAG AAA GGC ATC. CAC u*. ^ p ^ 

Glu Leu Lys Lys Glu Lys Gly lie His Arg **u 



70 75 



GCC CTG GAC CGC ATC AAC AAT GAC CCG GAC CTA CTG CCC AAC ATC ACG 
Ala Leu Asp Arg He Asn Asn Asp ProAsp ieu w. 10 o 



85 90 



1 .«m ^iv- r&r ICC TGC TCG AGG GAC ACC CAC GCC CTG ; 

cyl I« at, asp ^ g «. 

105 110 

GAG CAG TCA CTG ACC TTT GTG CGG «G CTC ATC GAG AAG GAC GGC ACG 

Glu Gin ser Leu Thr Phe Val Arg Ala Leu f±« * , 130 
• 120' r 125 ■. 

s 'a £ w aafs s.S s S - ~ - 

135 140 



150 155 



GTG GCC AAC ATC CTC CGC CTC TTC AAG ATC CCT CAG ATC AGO TAT GCC 
Val Ala Asn He Leu Arg Leu Phe Lys lie Pro oin j. iso 
165 i 170 

s s s ss » = S 25 55 - "'S 8 

185 A 
CGG GTG GTG CCC TCA GAC ACA TAC CAG GCC CAG GCC ATG GTG GAT ATT 

_,« C T!TUTE SHEET 
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Aw Val Val Pro Set Asp Thr Tyr Gin Ala Gin Ala Met Val Asp lie 

200 205 



3? g'SS £ S JSS $ S SS2 K.S2 SiS SS 

215 220 

&SC TAC «GT GAG ACT GGT GTG GAG GCC TTT ATC CAG AAG TCC CGA GAG 
Ser ™r "SlJ ciu Ser Gly Val Glu Ala Phe lie Gin Lys Ser Arg Glu 

230 235 

SSS S S5 2 S S2 S! IS. K S S 3! » S 

245 250 255 

aM'B^'s'sg - & 2 ' Si r r § r 

**m irr rrT ATC ATC ATC TTT GCC AAC GAG GAT GAC ATC AGG AGG 

j£. jg SS S? S S "J AS„ Glu Up «P U. Ar, AT, 

280 - 2 ^5 

S2S2 SSS K2S ^ £ iS.SS 2 5 

295 300 

'sasss »aa a as s gas 52" 

310 315 

a a a a as "s a:s-s s .a a « a « a 

325 330 339 

„, ttc GAC CGA TAC TTC TCC AGC CGC ACG CTG GAC AAC AAC 

val Sg 2£ «£ JJR.S £r Phe Ser ser Arg Thr Leu Asp Asn Asn 

rrr i&e ATC TGG TTT GCC GAG TTC TGG GAG GAC AAC TTC CAT TGC 
£g Sg £n He f£.£! £a Glu Phe Trp Glu Asp Asn Phe His Cys 

360 3 " 



a a a a s.s a a a a s 5 a a - 

390 395 
AAG GTG CAG TTC GTG ATT GAC GOT GTG TAC GCC ATG GGC CAC GCG CTG 
i}S val Gin Phe Val lie Asp Ala Val Tyr Ala Met Gly His Ala Leu 



CAC GCC ATG CAC CGT GAC CTG TGT CCC GGC CGC GTA GGA CTC TGC CCT 

His Ala Met His Arg Asp Leu cys Pro Gly Arg Val Gly Leu cys 
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94 

435 



r ._ sec » cc cag CTG CTT AAG TAC ATC AGG AAC 
% Z g 55 £ £ S S S UU L». m U. *, M. 

_ *nn rrr GTA ACC TTC AAT GAG AAC 

SS S 5 S2-S S S S S ■ S S > «--» 

455 460 

sssssssssss£25™ 

470 475 

TCG GCC GAG TAC AAG GT= ATC Og.»'«« *g « g5 £. gS 

Gly Ser Ala Glu Tyr Lys Val He Gly ser Trp in v 5QQ 
485 490 

Leu Arg He Glu Arg Met Gin Trp Pro Gly ser u y ^ 

505 3 ' 

s = s 5 sis s f ; s'-sa's' « a 

5 20 525 

ws'-b-s s s-sas as i sa?.s 

535 540 



._ _ . 



550 555 



CCG CCC ACA GAS AAC CGC ACG £.£ CAC CCC ATC CCC MC GTC JAG 
Arg Pro Thr Glu Asn Arg Thr Ser Cys Gin pro xxe sgo 



565 570 



« GAG TGG GAC TCG COG TGQ GCC GTG CTG CCC CTC TTC QTG GCC GTG 
Leu Glu Trp Asp Ser Pro Trp Ala vai tju rtu 595 

585 



s s s-s ssasa s « s-s g« 

600 .605 



~~ ^ rrr TCG GGC CGG GAG CTG AGC TAC GTG 

£*.£ S;S.2 S -» a - w .~ 

615 620 

s. sis. £ S 5 = S S 2. S S S =. S a 



630 635 



sssasgsa==3™ =s S 
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CTC GGC ATG AGC ATC AGC TAC GCG GCC CTG CTG ACC AAG ACC AAC CGC 2490 
Leu Glv Met Ser lie Ser Tyr Ala Ala Leu. Leu Thr Lys Thr Asn Arg 

665 .670 675 

ATT TAC CGC ATC TTT GAG CAG GGC AAA CGG TCG GTC AGT GCC CCG CGT 2538 
lie Tyr Arg lie Phe Glu Gin Gly Lys Arg Ser Val Ser Ala Pro Arg 
680. 685 690 

TTC ATC AGC CCG GCC TCG CAG CTG GCC ATC ACC TTC ATC CTC ATC TCC 2586 
Phe lie Ser Pro Ala Ser Gin Leu Ala He Thr Phe He Leu lie Ser 



695 



700 705 



CTG CAG CTG CTC GGC ATC TGC GTG TGG TTC GTG GTG GAC CCC TCC CAC 2634 
Leu Gin Leu Leu Gly He Cys Val Trp Phe Val Val Asp Pro Ser His 
710 715 720 . 

TCG GTG GTG GAC TTC CAG GAC CAA CGG ACA CTT GAC CCC CGC TTT GCC 2682 
ser *al Val Asp Phe Gin Asp Gin Arg Thr Leu Asp Pro Arg Phe Ala 
725 730 735 740 

AGG GGC GTG CTC AAG TGC GAC ATC TCG GAC CTG TCC CTC ATC TGC CTG 2730 
Arg Gly Val Leu Lys Cys Asp He Ser Asp Leu Ser Leu He Cys Leu 

745 750 755 

CTG GGC TAC AGC ATG CTG CTG ATG GTC ACG TGT ACT GTG TAC GCC ATC 2778 
Leu Gly Tyr ser Met Leu Leu Met Val Thr cys Thr Val Tyr Ala He 
760 765 770 

AAG ACC CGA GGC GTG CCC GAG ACC TTC AAC GAG GCC AAG CCC ATC GGC 2826 
Lys Thr Arg Gly Val Pro Glu Thr Phe Asn Glu Ala Lys Pro He Gly 
775 780 785 . 

TTC ACC ATG TAC ACC ACC TGC ATT GTC TGG CTG GCC TTC ATC CCC ATC 2874 
Phe Thr Met Tyr Thr Thr Cys lie Val Trp Leu Ala Phe He Pro He 
790 .795 800 

TTT TTT GGC ACC TCA CAG TCA GCC GAC AAG CTG TAC ATC CAG ACA ACC 2922 
Phe Phe Gly Thr Ser Gin ser Ala Asp Lys Leu Tyr He Gin Thr Thr 
80S 810 815 820 

ACA. CTG ACG GTC- TCC GTG AGT CTG AGC GCT TCA GTG TCC CTG GGG ATG 2970 
Thr Leu Thr Val Ser Val Ser Leu Ser Ala Ser Val Ser Leu Gly Met 

825 830 . 835 

CTC TAC ATG CCC AAA GTC TAC ATC ATC CTC TTC CAC CCG GAG CAG AAC 3018 
2u Tyr Met Pro Lys Val Tyr lie He Leu Phe His Pro Glu Gin Asn; 

845 850 



840 



GTG CCC AAG CGC AAG CGC AGT CTC AAA GCC GTG GTC ACC GCC GCC ACC 3066 
Val- Pro Lys Arg Lys Arg Ser Leu Lys Ala Val Val Thr Ala Ala Thr 
855 860 869 

ATG TCC AAC AAG TTC ACA CAG AAG GGC AAC TTC AGO CCC AAT GGG GAA 3114 
Met Ser Asn Lys Ph Thr Gin Lys Gly Asn Phe Arg Pro Asn Gly Glu 
870 875 880 

GCC AAA TCA GAG CTG TGT GAG AAC CTG GAG ACC CCA GCG CTG GCT ACC ^ 3162 
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Al^Lys ser Glu Leu Cys Glu Asn Leu Glu Thr Pro Ala Leu Ala Thr 

' ' ' 890 ' < ovo 



AAA CAG ACC TAC GTC ACC TAG ACC AAC CAT GCC ATC TAGCCGGGCC. ; 

L ys Gin Thr Tyr Val Thr Tyr Thr Asn His Ala. He 
" 905 910 -. . 

GCGGAGCCAA-GCAGGCTAAG GAGCCACAAC CTCTGAGGAT GGCACATTGG GCCAGGGCCG 
TTCCCGAGGG CCCTGCCGAT GTCTGCCCGC CTCCCGGGCA TCCACGAATG TGGCTTGGTG 
CTGAGGACAG TAGAGACCCC GGCCATCACT GCTGGGCAAG CCGTGGTGGG CAACCAGAGG 
AGGCCGAGTG GCTGGGGCAG TTCCAGGTTA TGCCACACAC AGGTCTTCCT TCTGGACCAC 

. TGTTGGCCCA GCCCCAAAGC ACAGGGGCTC GGTCTCCAGA GCCCAGCCCT GGCTTCCTCT 
qOTCCTCCT GCCTCCGTCT GTCCTGTGGG TGACCCCGGT TGGTCCCTGC CCCGTCTTTA 
CGTTTCTCTT CCGTCTTTGC TCTGCATGTG. TTGTCTGTTT GGGCCCTCTG CTTCCATATT 
TTTCCATTCT GGTCCTGGCC TTCCCCTGCC ATCTGCCCTG CCCCCTGCCC CTCCTCCCTG 
AGCTGCCCCA TCCCCGCCAT CATTTTCTCT TCTGTTCCCC CTCGATCTCA TTTCCTACCA 
GCCTTCCCCC TACTTGGCTT CATCCACCAA ^ ACGTTGCAAA AGAGAAAAAA 

AAAGGGGGGG GGGAATCACC CCCTACAAAA AAGCCCAAAC AAAAACTAAT CTTGAGTGTG 
TTTCGAAGTG CTGCGTCCTC CTGGTGGCCT GTGTGTCCCT GTGCCTGCAG CCTGTCTGCC 
CGCCCTACCC GTCTGCCGTG TGTCCTGCCC CCCCCGCCTG CCCGCCTTGC CCTTCCTGCT 
AACGACACGG AGTTCAGTGC CTGGGTGTTT GGTGATGGTC TCTGATGTGT AGCATGTCTG 
TTTTTATACC GAGAACATTT CTAATAAAGA TAAACACATG GTTTTGC 

(2) INFORMATION FOR SEQ IP NO: 19 S 

(i) SEQUENCE CHARACTERISTICS: " 

(A) LENGTH: 912 amino acids 

(B) TYPE: amino acid 
(0) TOPOLOGY: linear . 

(ii) MOLECOLE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met Ser Gly Lys Giy G i y Trp Ala Trp Trp Trp Ala Arg Leu P^ Leu 



3208 

3268 

3328 

3388 

3448 

3508 

3568 

3628 

3688 

3748 

3808 

3868 

3928 

3988 

4048 

4095 



cyeleu i*» Leu S« Leu Tyr Ala ^ Trp V.1 Pro s Gly 
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lie Thr Leu Gly Gly Leu Phe Pro val His Gly Arg Gly ser Glu Gly 

'55 60 



50 



Lys 

65 



Ala cys Gly Glu Leu Lys Lys Glu Lys Gly He His Arg Leu Glu 



70 



75 



Ala Met Leu kie Ala Leu Asp Arg He Asn Asn Asp Pro Asp Leu Leu 

85 90 95 

Pro Asn He Thr Leu Gly Ala Arg lie Leu Asp Thr Cys Ser Arg Asp 
100 105 11 

Thr His Ala Leu Glu Gin Ser Leu Thr Phe Val Arg Ala Leu lie Glu 
115 I 20 125 

Lys Asp Gly Thr Glu Val Arg Cys Gly Arg Arg Gly Pro Pro lie lie 
130 135 140 

Thr Lys Pro Glu Arg val Val Gly Val He Gly Ala Ser Gly Ser S|r 



145 



150 



155 



Val ser He Met Val Ala Asn lie Leu Arg Leu Phe Lys lie Pro Gin 

165 170 1"5 

He Ser Tyr Ala ser Thr Ala Pro Asp Leu Ser Asp Asn Ser Arg Tyr 



180 



185 



Asp Phe Phe Ser Arg Val Val Pro Ser Asp Thr Tyr Gin Ala Gin Ala 
195 r 200 205 

Met Val Asp He Val Arg Ala Leu Lys Trp Asn Tyr Val Ser Thr Leu 
210 215 220 

Ala ser Glu dly Ser Tyr Gly Glu Ser Gly val Glu Ala Phe He Gin 



225 



Lys 



230 



235 



Ser Arg Glu Asn Gly Gly Val Cys lie Ala Gin Ser Val Lys lie 



245 



250 



Pro Arg Glu Pto Lys Thr Gly Glu Phe Asp Lys He He Lys Arg Leu 



260 



265 



Leu Glu Thr Sir Asn Ala Arg Gly He lie He Phe Ala Asn Glu Asp 

275 i 280 285 : 

Asp He Arg A^g Val Leu Glu Ala Ala Arg Arg Ala Asn Gin Thr Gly 
290 295 300 

His Phe Phe Trp Met Gly Ser Asp Ser Trp Gly Ser Lys Ser Ala Pro 



315 



305 ; 310 

Val Leu Arg iiu Glu Glu Val Ala Glu Gly Ala Val Thr He Leu Pr 



325 



330 



Lys 



Arg Ket Ser Val Arg Gly Phe Asp Arg Tyr Phe Ser Ser Arg Thr 



340 



345 
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Leu Asp Asn Asn Arg Arg Asn lie Trp Phe Ala Glu Phe Trp Glu Asp 
355 . 360 365 

Asn Phe His cys Lys Leu Ser Arg. His Ala Leu Lys Lys Gly Ser His 
370 375 38.0 

lie Lys Lys cys Thr Asn Arg Glu Arg He Gly Gin Asp Ser Ala Tyr 
385 - 390 

Glu Gin Glu Gly Lys val Gin Phe Val lie Asp Ala Val Tyr Ala Met 

405 410 

Gly His Ala Leu His Ala Met His Arg Asp Leu cys Pro Gly Arg Val 
420 425 

Gly Leu Cys Pro Arg Met Asp Pro Val Asp Gly Thr Gin Leu Leu Lys 

435 440 

Tyr lie Arg Asn Val Asn Phe Ser Gly lie Ala Gly Asn Pro Val Thr 

450 455 

Phe Asn Glu Asn Gly Asp Ala Pro Gly Arg Tyr Asp lie Tyr Gin Tyr 

465 470 . 4/D 

Gin Leu Arg Asn Gly Ser Ala Glu Tyr Lys val lie Gly Ser Trp Thr 

485 490 " w 

Asp His Leu His Leu Arg lie Glu Arg Met Gin Trp . Pro . Gly Ser Gly. 

500 505 
Gin Gin Leu Pro Arg Ser lie Cys Ser Leu Pro cys Gin Pro Gly Glu 

515 520 
Arg Lys Lys Thr val Lys Gly Met Ala cys Cys Trp His Cys Glu Pro 

530 535 
cys Thr Gly Tyr Gin Tyr Gin val Asp Arg Tyr Thr cys Lys Thr cys 

545 550 

Pro Tyr Asp Met Arg Pro Thr Glu Asn Arg Thr Ser Cys Gin Pro lie 

~565 570 • 

Pro lie val Lys Leu Glu Trp Asp Ser Pro Trp Ala Val Leu Pro Leu 

580 . 585 

Phe Leu Ala val Val Gly lie Ala Ala Thr Leu Phe Val Val Val Thr 

595' 



Phe val Arg Tyr Asn As* Thr Pro II Val Lys .Ala Ser Gly Arg Glu 

Leu 

625 630 635 

Phe Leu Met II. Ala Glu Pro Asp Leu Gly Thr Cys Ser Leu Arg Arg 



610 «15 620 - . 

Leu s r Tyr Val Leu Leu Ala Gly lie Phe Leu Cys Tyr Ala Thr Thr 



645 
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U* Phe Leu Gly Leu Gly Met Ser lie Ser Tyr Ala Ala Leu Leu Thr 



660 

Lys Thr Asn Arg lie Tyr Arg lie Phe Glu Glh Gly Lys Arg ser Val 

675 680 

Ser Ala Pro Arg Phe lie Ser Pro Ala Ser Gin Leu Ala lie Thr Phe 

690 • 695 
lie Leu lie Ser Leu Gin Leu Leu Gly lie Cys Val Trp Phe Val Val 
705 710 

Asp Pro Ser His Ser Val Val Asp Phe Gin Asp Gin Arg Thr Leu Asp 



725 730 

740 



Pro Arg Phe Ala Arg Gly val Leu Lys Cys Asp lie Ser Asp Leu Ser 



Leu lie cys Leu Leu Gly Tyr Ser Met Leu Leu Met val Thr cys Thr 

755 760 /oa 



val Tyr Ala lie Lys Thr Arg Gly val Pro Glu Thr Phe Asn Glu Ala 



770 775 780 

Lys Pro lie Gly Phe Thr Met Tyr Thr Thr cys lie Val Trp Leu Ala 
785 790 793 

Phe lie Pro lie Phe Phe Gly Thr ser Gin ser Ala Asp Lys Leu Tyr 

805 810 

lie Gin Thr Thr Thr Leu Thr Val Ser Val Ser Leu Ser Ala Ser Val 

820 ' * 

Ser Leu Gly Met Leu Tyr Met Pro Lys Val Tyr lie lie Leu Phe His 

■835- 

Pro Glu Gin Asn Val Pro Lys Arg Lys . Arg Ser Leu Lys Ala Val Val 

.850 855 860 

Thr Ala Ala Thr Met ser Asn Lys Phe Thr Gin Lys Gly Asn Phe Arg 
865 870 875 . 

Pro Asn Gly Glu Ala Lys Ser Glu Leu Cys Glu- Asn Leu Glu Thr Pro 

885 

Ala Leu Ala Thr Lys Gin Thr Tyr Val Thr Tyr Thr Asn His Ala lie 



900 



(2) INFORMATION FOR SEQ ID NO: 20: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2426 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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_ (ii) MOLECULE TYPE: cDNA 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: SR13 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
CCCAACATCA CGTTGGGCGC CCGCATTCTG GACACCTGCT CGAGGGACAC CCACGCCCTG 
GAGCAGTCAC TGACCTTTGT GCGGGCGCTC ATCGAGAAGG ACGGCACGGA GGTCCGCTGC 
GGCAGGCGGG GCCCGCCCAT CATCACCAAG CCCGAACGAG TGGTGGGTGT CATTGGAGCT 
TCGGGGAGCT CCGTCTCGAT CATGGTGGCC AACATCCTCC GCCTCTTCAA GATCCCTCAG 
ATCACCTATG CCTCCACGGC CCCTGACTTG AGTGACAACA GCCGCTATGA CTTCTTCTCC 
CGGGTGGTGC CCTCAGACAC ATACCAGGCC CAGGCCATGG TGGATATTGT CCGAGCCCTC 
AAGTGGAACT ATGTGTCCAC ACTGGCCTCA GAGGGCAGCT ACGGTGAGAG TGGTGTGGAG 
GCCTTTATCC AGAAGTCCCG AGAGAACGGA GGTGTGTGCA TTGCCCAGTC GGTGAAGATT 
CCACGGGAAC CCAAGACGGG GGAGTTCGAC AAGATCATCA AACGCCTACT GGAAACATCC 
AATGCCAGGG GTATCATCAT CTTTGCCAAC GAGGATGACA TCAGGAGGGT GTTGGAGGCA 
GCTCGCAGGG CCAACCAGAC CGGCCACTTC TTTTGGATGG GTTCTGATAG CTGGGGCTCC 
AAGAGTGCCC CTGTGCTGCG CCTTGAGGAG GTGGCCGAGG GCGCAGTCAC CATTCTCCCC 
AAGAGGATGT CTGTTCGAGG GTTCGACCGA TACTTCTCCA GCCGCACGCT GGACAACAAC 
AGGCGCAACA TCTGGTTTGC CGAGTTCTGG GAGGACAACT TCCATTGCAA GTTGAGCCGC 
CACGCGCTCA AGAAGGGAAG CCACATCAAG AAGTGCACCA ACCGAGAGCG CATCGGGCAG 
GACTCGGCCT ATGAGCAGGA GGGGAAGGTG CAGTTCGTGA TTGACGCTGT GTACGCCATG 
GGCCACGCGC TGCACGCCAT GCACCGTGAC CTGTGTCCCG. GCCGCGTAGG ACTCTGCCCT 
CGCATGGACC CCGTGGATGG CACCCAGCTG CTTAAGTACA TCAGGAACGT CAACTTCTCA 
GGCATTGCGG GGAACCCTGT AACCTTCAAT GAGAACGGAG ACGCACCGGG GCGCTACGAC 
ATCTACCAGT ACCAACTGCG CAATGGCTCG GCCGAGTACA AGGTCATCGG CTCGTGGACA 
GACCACCTGC ACCTCAGAAT AGAGCGGATG CAGTGGCCAG GGAGTGGCCA GCAGCTGCCG 
CGCTCCATCT GCAGTCTGCC CTGCCAGCCC GGGGAGCGAA AGAAGACTGT GAAGGGCATG 
GCTTGCTGCT GGCACTGCGA GCCCTGCACC GGGTACCAGT ACCAAGTGGA CCGCTACACC 
TGTAAGACCT GCCCCTACGA CATGCGGCCC ACAGAGAACC GCACGAGCTG CCAGCCCATC 

SUBSTITUTE SHEET 



60 
120 
180 
240 
300 
360 
420 
480 
540 . 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



WO 92/10583 



PCT/US91/09422 



101 

CCCATCGTCA AGTTGGAGTG GGACTCGCCG TGGGCCGTGC TGCCCCTCTT CCTGGCCGTG 
GTGGGCATCG CCGCCACGCT GTTCGTGGTG GTCACGTTTG TGCGCTACAA CGATACCCCC 
ATCGTCAAGG CCTCGGGCCG GGAGCTGAGC TACGTGCTGC TGGCGGGCAT CTTTCTGTGC 
TACGCCACTA CCTTCCTCAT GATCGCAGAG CCGGACCTGG GGACCTGTTC GCTCCGCCGC 
ATCTTCCTAG GGCTCGGCAT GAGCATCAGC TACGCGGCCC TGCTGACCAA GACCAACCGC 

atttaccgca' tctttgagca gggcaaacgg tcggtcagtg ccccgcgttt CATCAGCCCG 

GCCTCGCAGC TGGCCATCAC CTTCATCCTC ATCTCCCTGC AGCTGCTCGG CATCTGCGTG 
TGGTTCGTGG TGGACCCCTC CCACTCGGTG GTGGACTTCC AGGACCAACG GACACTTGAC 
CCCCGCTTTG CCAGGGGCGT GCTCAAGTGC GACATCTCGG ACCTGTCCCT CATCTGCCTG 
CTGGGCTACA GCATGCTGCT GATGGTCACG TGTACTGTGT ACGCCATCAA GACCCGAGGC 
GTGCCCGAGA CCTTCAACGA GGCCAAGCCC ATCGGCTTCA CCATGTACAC CACCTGCATT 
GTCTGGCTGG CCTTCATCCC CATCTTTTTT GGCACCTCAC AGTCA6CCGA CAAGCTGTAC 
ATCCAGACAA CCACACTGAC GGTCXCCGTG AGTCTGAGCG CTTCAGTGTC CCTGGGGATG 
CTCTACATGC CCAAAGTCTA CATCATCCTC TTCCATATTT TTCCATTCTG CTCCTGGCCT 
TCCCCTGCCA TCTGCCCTGC CCCCTGCCCC TCCTCCCTGA GCTGCCCCAT CCCCGCCATC 
ATTTTCTCTT CTGTTCCCCC TCGATCTCAT TTCCTACCAG CCTT.CCCCCT ACTTGGCTTC 
CTCCACCAAC TCTTTCACCA CGTTGC 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid . 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: . peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Arg Asp Ser Leu lie ser lie Arg Asp Glu Lys Asp Gly Leu Asn ^ Arg 

• 1 ' - 5 • 

Cys *. ' ■ 

(2 ) INFORMATION FOR SEQ ID NO:22: 
(i) SEQUENCE CHARACTERISTICS: 

csi INSTITUTE SHEET 
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(A) LENGTH: 16 amino acids 

(BY TYPE: amino acid ■ 

•(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Asp Arc, Leu Leu Arg Lys I*u Arg Glu Arg i*» Pro Lys Ala Arg val 

■ ' 1 " ■ ■' : 5 , ■ / .■' • ; ■ ' ■ ; /.../- ' ■ 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear ; : 
(ii) MOLECULE TYPE: peptide .■ ;>. . . ^ • 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

GluGluW^ Arg Tyr Asp 



(2) INFORMATION FOR SEQ ID NO: 24: 

' (i) SEQUENCE CHARACTERISTICS : 

(A) 1 LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) ; STRANDEDNESS: , single 

(D) I TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: V 
Clu Phe Val Tyr Glu Arg Glu Gly Asn TJr Glu Glu Asp Glu Leu 

(2) INFORMATlbN FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 
. • (A) LENGTH: 17 amino acids 
(B]i TYPE: aain acid. 
(CJ STRANDEDNESS: single 
(D) TOPOLOGY: linear 

SUBSTITUTE SHEET 
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~ (ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Pro Glu Arg Lys Cys Cys Glu Zla Arg Glu Gin Tyr Gly He Gin Arg 



val 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

lie Gly Pro Gly Ser Ser Ser Val Ala lie Gin Val Gin Asn Leu Leu 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

He Ala Tyr Ser Ala Thr Ser He Asp Leu Ser Asp Lys Thr Leu 
1 5 1° 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amin acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

Lys Lys Pro Gly Aia Gly Asn Ala Lys Ly. Arg Gin Pro Olu Phe Ser 

1 ■ 5 .. * .': 

' ■ * . ■ 

(2) INFORMATION FOR SEQ ID- NO* 29 1 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino adds 

(B) TYPE: amino acid. 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Pro Glu Phe Ser Pro Ser Ser Gin Cys Pro Ser Ala His Ala Gin Leu 

* . • 1 5 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Asp Lys lie lie Lys Arg Leu Leu Glu Thr Ser Asn Ala Arg Gly 

(2) INFORMATION FOR SEQ ID NO: 31: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS.' single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
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val Asn Phe Ser Gly lie Ala Gly Asn Pro Val Thr Phe Asn Glu Asn 

1 5 10 . 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

7A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Gly Glu Ala Lys Ser Glu Leu Cys .Glu Asn Leu Glu Thr- Pro Ala Leu 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Pro Ala Arg Leu Ala Leu Pro Ala Asn Asp Thr Glu Phe Ser Ala Trp 
1 5 10 

Val 
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WH&T TS CIAIMED IS: 

1. An isolated mammalian G protein-rcoupled glutamate 
receptor or a. fragment thereof. 

2. The G protein-coupled glutamate receptor of claim 
1, which is substantially pure. 

3. The G protein-coupled glutamate receptor of claim 
10 1, which is human or rodent. 

4 . An antiserum obtained from an animal immunized 
with the G protein-coupled glutamate receptor of claim 1. 

15 5. A monoclonal antibody which specifically binds to 

the G protein-coupled glutamate receptor of claim 1. 

6. The G protein-coupled glutamate receptor of claim 
1, which binds glutamate or quisqualate and thereby activates 
20 phospholipase C or stimulates inositol phospholipid metabolism 
in a vertebrate ceil.' 

7; A recombinantly produced polypeptide having the 
activity of a mammalian G protein-cbupied glutamate receptor. . 

8. The polypeptide of claim 7, which has the 
activity of a human or rodent mammalian G protein-coupled • 
glutamate receptor. 

20 9. An isolated and purified polynucleotide molecule 

which codes for a mammalian G protein-coupled glutamate 
receptor or a fragment thereof. 

10. The polynucleotide of claim 9 , which is a 
35 genomic DNA sequence, a cDNA , sequence, or an RNA antisehse 

sequence. '■' 
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11. Th polynucle tid of claim 9, which codes for 
human or rodent G protein-coupled glutamate receptor. 

12. The polynucleotide of claim 9, which encodes a 
polypeptide displaying mammalian G protein-coupled glutamate 
receptor activity. 

: 13. The polynucleotide of claim 9, which is 
substantially the sequence of Fig. 5, Fig, 7, Fig; 8 or Fig. 9. 

14. A probe which comprises an oligonucleotide 
capable of specifically hybridizing with a gene which encodes a 
mammalian G protein-coupled glutamate receptor or a fragment 

-■thereof.-. 

15. The probe of claim 14 , which comprises from 
about 40 to about 60 nucleotides in length. 

16. The probe of claim 15, which is labeled to 
20 provide a detectable signal. 

17. A DNA construct comprising the following 
pperably linked elements: 

a transcriptional promoter; » 
25 a DNA sequence encoding a mammalian G protein-coupled 

glutamate receptor or a fragment thereof; and 
a transcriptional terminator. 

18. The DNA construct of claim 17, wherein the DNA 
30 sequence encodes a human or rodent G protein-coupled glutamate 

receptor polypeptide. 

19. The DNA construct of claim 17, wherein the DNA 
sequence encoding the mammalian G protein-coupled glutamate 

35 receptor is; substantially the sequence of Fig. 5, Fig. 7, Fig. 

8 or- Fig. 9\. . ^ 
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20. A cultured eukaryotic cell transformed or 
transf acted with a DNA construct which comprises the following " 
operably linked elements: 

a transcriptional promoter; 
5 a DNA sequence encoding a mammalian G protein-coupled 

glutamate receptor or a fragment thereof; and 
a transcriptional terminator. 

21. The eukaryotic cell of claim 20, which is a 
10 mammalian. cell. 

22. The eukaryotic cell of claim 20, which does not 
express endogenous G protein-coupled glutamate receptors. 

15 23. The eukaryotic cell line of claim 20, wherein 

the DNA sequence encodes a human or rodent G protein-coupled 
glutamate receptor polypeptide. 

24. The eukaryotic cell line of claim 21, wherein 

20 the G protein-coupled glutamate receptor polypeptide encoded by 
the DNA sequence is coupled to G protein in a mammalian cell. 

25. The DNA construct of claim 20, wherein the DNA 
sequence encoding the mammalian. G protein-coupled glutamate 

25 receptor is substantially the sequence of Fig. 5, Fig. 7, Fig. 
8 or Fig. 9. 

26. A method for producing a mammalian G protein- 
coupled glutamate receptor, which comprises: 

30 growing eukaryotic cells transformed or transf acted 

with a DNA construct which comprises a DNA sequence coding for 
the expression of the G protein-coupled glutamate receptor, and 
isolating the receptor from the cells. 



35 



27. The method of claim 26, wherein the cells are 
cultured mammalian cells. 
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28. Th method of claim 26, wherein the glutamate 
receptor is human or rodent. 

29. The method of claim 26, wherein the glutamate 
receptor is isolated by immunoaf f inity purification. ; 

30. The method of claim 26, wherein the 6 protein* 
coupled glutamate receptor is not coupled to protein G in the 
eukaryotic cells. 

31. A method for determining the presence of a 
mammalian G protein-coupled glutamate receptor in a biological 
sample, which comprises , incubating the sample with a 
monospecific antibody which specifically binds to the receptor 
under conditions sufficient for immune complex formation and 
determining therefrom the presence of the immune complexes. 

32. The method of claim 31, wherein the monospecific 
antibody is a monoclonal antibody or a purified antiserum. 

33. The method of claim 32, wherein the monospecific 
antibody is labeled. 

34. A method for identifying a compound which alters 
G protein-coupled glutamate receptor mediated-metabolism, which 
comprises incubating the compound with eukaryotic cells which 
express recombinant mammalian G protein-coupled glutamate 
receptor and determining therefrom the effect of said compound 
on receptor-mediated metabolism in the cells. 

35. The method of claim 34, wherein the compound is 
incubated with the receptor and ligand. ^ 

36. The method of claim 35, wherein the ligand is 
glutamate or quisqualate. 
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37. The method of claim 34 , wherein the eukaryotic 
cell expresses a human or rodent G protein-coupled glutamate 

receptor. 

38. The method of claim 37, wherein inositol 
phospholipid metabolism in the eukaryotic cell is monitored for 
alteration by the compound. 
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CCGAGAACGG CTGCAGTCCT CTGACCTGAG ACCAATAGCT GTGTCTACCC GGACTCAGCG 

; >- ' : ■ \ ■ ' :.: ' 120 

TCCAGCTCAC CGCCACTAAC GCGCCGCGCA TTGGACACCT GATCCACACA CCTTCGGGCA 

' • i- -" ' '■ i ' -.' ' • . 180 

CCAGTGAAAA ACCGCGACTT GATTTTCTGG AAGAACGCCC CCAGGGTGTG GGAGCGGTCG 

\':'/' . ; ! ■ - " ; "' "'-240 

TGGAGGACCA GCAGGAGGAA GCGGAGGGGA GAGGGGCAGT AGTGGAGGCA GAGAAAGCGT 

•■' ■ 1 ; "'• ; '" ' \ ■ : "360' 

TGAACCAGCT GTGTTGGCCG- AAGGCACGAA ACGGCAAAAG GCAGCGGTGA GCATCTGTGT 

:i •" : . ■■• ' 360 

GGTTCCCGCT GGGAAGQTGC AGGCAGGACC GGCGTGGGAA GGTGGCTGGC CCGCGGTGGA 

'■ ■.. . ■]/:. ■ ■ ■ ■ ,409 . /■ 

CCGCGTCTTC GCCACA ATG GTC CGG CTC CTC TTG ATT TTC TTC CCA ATG 

Met Val .Arg Leu Leu Leii lie Phe Phe Pro Met 

: ■ ''V' ' - 1 ' 5 • • ■ . ■ ; 10 .. ;. ■ 

. • . " A.'- ■' • 457' .- 

ATC TTT TTG GAG ATG TCC ATT TTG CCC AGG ATG CCT GAC AGA AAA GTA 

He Phe Leu Glu Met Ser lie Leu Pro Arg Met Pro Asp Arg Lys Val 

; . ; ' 15 . ■ • 20 . t • : 25 ■.' 

"•' ■ . "... \\ . ; ; . " ' .' -..:■•'/ 505 

TTG CTG GCA GGT GCC TCG TCC CAG CGC TCC GTG GCG AGA ATG GAC GGA . 
Leu Leu Ala Gly Ala Ser Ser Gin Arg Ser Val Ala Arg Met Asp Gly 

■■ ; '30 ;; 35 . - 40 ...... 

I '" ' '• ' ''• * '. • ' • ■ 553 '' 

GAT GTC ATG ATC GGA GCC CTC TTC TCA GTC CAT CAC CAG CCT CCA GCC 

Asp Val He lie Gly Ala Leu Phe Ser Val His His Gin Pro Pro Ala 

• ' . 45 ■• 50 • •;. , 55 -\ •■ 

GAG AAG GTA CCC GAA AGG AAG TGT GGG GAG ATC AGG GAA CAG TAT GGT 
Glu Lys Val; Pro Glu Arg Lys Cys Gly Glu He Arg Glu Gin Tyr Gly . 

60 j'. 65 • 70 75 

.[. . ' '■• '.' .'■ 649 

ATC CAG AGG GTG GAG GCC ATG TTC CAC ACG TTG GAT AAG ATT AAC GCG 

lie G1n Arg Val Glu Ala Met Phe His Thr Leu Asp Lys He Asn Ala 

80 85 90 

■ ',.1 • . v. 697.' 

GAC CCG GTG CTC CTG CCC AAC ATC ACT CTG GGC AGT GAG ATC CGG GAC 

Asp Pro Val Leu Leu Pro Asn He Thr Leu Gly Ser Glu He Arg Asp 
1 95 100 : 105 

I' ■ .:".' , ; ■ ' 745 ■' 

TCC TGC TGG 1CAC TCT TCA GTG.GCT CTC GAA CAG AGC ATC GAA TTC ATC 
Ser Cys Trp *His- Ser Ser-Val Al a Leu : Gl u Gi n -Ser -H e Gl u Phe- li e 
HO i 115 120 



na 5A 

SUBSTITUTE SHEET 



WO 92/10583 . PCT/US91/09422 



8/32 - ■ 793 

AGA GAC TCC CTG ATT TCC ATC CGA GAT= GAG AAG GAT GGG CTG AAC CGA 

Arg Asp Ser Leu He Ser He Arg Asp Glu Lys Asp Gly Leu Asn Arg 
125 130 135 

841 

TGC CTG CCT GAT GGC CAG ACC CTG CCC CCT GGC AGG ACT AAG AAG CCT 

Cys Leu Pro Asp Gly Gin Thr Leu Pro Pro Gly Arg Thr Lys Lys Pro 

140 145 • . 150 155 

889 

ATT GCT GGA GTG ATC GGC CCT GGC TCC AGC TCT GTG GCC ATT CAA GTC 
He Ala Gly Val He Gly Pro Gly Ser Ser Ser Val Ala He Gin Val 

160 165 170 

CAG AAT CTT CTC CAG CTG TTC GAC ATC CCA CAG ATC GCC TAT TCT GCC 
Gin Asn Leu Leu Gin Leu Phe Asp lie Pro Gin He Ala Tyr Ser Ala 
175 180 185 

985 

ACA AGC ATA GAC CTG AGT GAC AAA ACT TTG TAC AAA TAC TTC CTG AGG 
Thr Ser He Asp Leu Ser Asp Lys Thr Leu Tyr Lys Tyr Phe Leu Arg 
190 195 200 

1033 

GTG GTC CCT TCT GAC ACT TTG CAG GCA AGG GCG ATG CTC GAC ATA GTC 
Val Val Pro Ser Asp Thr Leu Gin Ala Arg Ala Met Leu Asp He Val 
205 210 . 215 

AAG CGT TAC AAC TGG ACC TAT GTC TCA GCA GTC CAC ACA GA A GGG * AAT 
Lys Arg Tyr Asn Trp Thr Tyr Val Ser Ala Val His Thr Glu Gly Asn 
220 225 230 235 

TAC GGC GAG AGT GGA ATG GAT GCT TTC AAA GAA CTG GCT GCC CAG^AA 
Tyr Gly Glu Ser Gly Met Asp Ala Phe Lys Glu Leu Ala Ala Gin Glu 

240 245 250 

1177 

GGC CTC TGC ATC GCA CAC TCG GAC AAA ATC TAC AGC AAT GCT GGC GAG 
Gly Leu Cys lie Ala His Ser Asp Lys He Tyr Ser Asn Ala Gly Glu 
255 . 260 265 

1225 

AAG AGC TTT GAC CGG CTC CTG CGT AAA CTC CGG GAG CGG CTT CCC AAG 
Lys Ser Phe Asp Arg Leu Leu Arg Lys Leu Arg Glu Arg Leu Pro Lys 
270 275 280 

' 1273 

GCC AGG GTT GTG GTC TGC TTC TGC GAG GGC ATG ACA GTG CGG GGC TTA 
Ala Arg Val Val Val, Cys Phe Cys Glu Gly Met Thr Val Arg Gly Leu 
285 290 295 

Fia 5B. 



SUBSTITUTE SHEET 



WO 92/10583 PCT/US91/09422 

9/32 

1321 

CTG AGT GCC ATG CGC CGG CTG GGC GTC GTG GGC GAG TTC TCA CTC ATT 
Leu Ser Ala Met Arg Arg Leu Gly V.aT Val Gly Glu Phe Ser Leu He 
300 305 310 315 

1369 

GGA AGT GAT GGA TGG GCA GAC AGA GAT GAA GTC ATC GAA GGC TAT GAG 
Gly Ser Asp Gly Trp Ala Asp Arg Asp Glu Val He Glu Gly T^r Glu 



320 325 330 

1417 

GTG GAA GCC AAC GGA GGG ATC AC A ATA AAG CTT CAG TCT CCA GAG GTC 
Val Glu Ala Asn Gly Gly He Thr He Lys Leu Gin Ser Pro Glu Val 
335 340 345 

1465 

AGG TCA TTT GAT GAC TAC TTC CTG AAG CTG AGG CTG GAC ACC AAC ACA 
Arg Ser Phe Asp Asp Tyr Phe Leu Lys Leu Arg Leu Asp Thr Asn Thr 
350 355 360 

• ' 1513 

AGG AAT CCT TGG TTC CCT GAG TTC TGG CAA CAT CGC TTC CAG TGT CGC 

Arg Asn Pro Trp Phe Pro Glu Phe Trp Gin His Arg Phe Gin Cys Arg 

365 . 370 . 375 

1561 

CTA CCT GGA CAC CTC TTG GAA AAC CCC AAC TTT AAG AAA GTG TGC ACA 

Leu Pro Gly His Leu Leu Glu Asn Pro Asn- Phe Lys Lys Val Cys Thr 

380 385 390 - 395 

• 1609 
GGA AAT GAA AGC TTG GAA GAA AAC TAT GTC CAG GAC AGC AAA ATG GGA 
Gly Asn Glu Ser Leu Glu Glu Asn Tyr Val Gin Asp Ser Lys Met Gly 

400 405 410 

1657 

TTT GTC ATC AAT GCC ATC TAT GCC ATG GCA CAT GGG CTG CAG AAC ATG 
Phe Val He Asn Ala He Tyr Ala Met Ala His Gly Leu Gin Asn Met 
415 420 425 

1705 

CAC CAT GCT CTG TGT CCC GGC CAT GTG GGC CTG TGT GAT GCT ATG AAA 
His His Ala Leu Cys Pro Gly His Val Gly Leu Cys Asp Ala Met Lys 
430 435 440 

" 1753 

CCC ATT GAT GGC AGG AAG CTC CTG GAT TTC CTC ATC AAA TCC TCT TTT 

Pro He Asp Gly Arg Lys Leu Leu Asp Phe Leu lie Lys Ser Ser Phe 

445 450 455 

1801 

GTC GGA GTG TCT GGA GAG GAG GTG TGG TTC GAT GAG AAG GGG GAT GCT 

Val Gly Val Ser Gly Glu Glu Val Trp Phe Asp Glu Lys Gly Asp Ala 

460 465 470 475 



Fia 5C 



SUBSTITUTE SHEET 



WO 92/10583 PCT/US91/09422 

10/32 1849 

CCC GGA AGG TAT GAC ATT ATG AAT CTG CAG TAC ACA GAA GCT AAT CGC 

Pro Gly Arg Tyr Asp lie Met Asn Leu Gin Tyr. Thr Gl u Ala Asn Arg 

480 485 490 

1897 

TAT GAC TAT GTC CAC GTG, GGG ACCTGG CAT GAA GGA GTG CTG AAT ATT 

Tyr Asp Tyr Val His Val Gly Thr Trp His Gl u Gly Val Leu Asn He 

495 500 505 

• 1945 
GAT GAT TAC AAA ATC CAG ATG AAC AAA AGC GGA ATG GTA CGA TCT GTG 
Asp Asp Tyr Lys He Gin Met Asn Lys Ser Gly Met Val Arg Ser Val 
510 . 515 520 

1993 

TGC AGT GAG CCT TGC TTA AAG GGT CAG ATT AAG GTC ATA CGG AAA GGA 
Cys Ser Glu Pro Cys Leu Lys Gly Gin He Lys Val He Arg Lys Gly 
525 530 535 

2041 

GAA GTG AGC TGC TGC TGG ATC TGC ACG GCC TGC AAA GAG AAT GAG TTT 
Glu Val Ser Cys Cys Trp He Cys Thr Ala Cys Lys Glu Asn Glu Phe 
540 545 .550 555 

2089 

GTG CAG GAC GAG TTC ACC TGC AGA GCC TGT GAC CTG GGG TGG TGG CCC 
Val Gin Asp Glu Phe Thr Cys Arg Ala Cys Asp Leu Gly Trp Trp Pro 

560 565 570 

2137 

AAC GCA GAG CTC ACA GGC TGT GAG CCC ATT CCT GTC CGT TAT CTT GAG 
Asn Ala Glu Leu Thr Gly Cys Glu Pro He Pro Val Arg Tyr Leu Glu 
575 580 . 585 



2185 

TGG AGT GAC ATA GAA TCT ATC ATA GCC ATC GCC TTT TCT TGC CTG GGC 

Trp Ser Asp lie Glu Ser He He Ala He Ala Phe Ser Cys Leu Gly 

590 595 600 

' " : ' 2233 

ATC CTC GTG ACG CTG TTT GTC ACC CTC ATC TTC GTT CTG TAC CGG GAC 

He Leu Val Thr Leu Phe Val Thr Leu lie Phe Val Leu Tyr Arg Asp 

605 610 615 



•■'2281 

ACA CCC GTG GTC AAA TCC TCC AGT AGG GAG CTC TGC TAT ATC ATT CTG 

Thr Pro: Val Val Lys Ser Ser Ser Arg Glu Leu Cys Tyr He He Leu 

620 625, 630 635 

'■ ~~ ~~ ~"~ "~2329 

GCT GGT ATT TTC CTC GGC TAT GTG TGC CCT TTC ACC CTC ATC GCC AAA 

Ala Gly He Phe Leu Gly Tyr Val Cys Pro Phe Thr Leu He Ala Lys 

640 645 650 

na 50. 



SUBSTITUTE SHEET 



WO 92/10583 PCT/US91/09422 

;23/32 . - . 

- 60' 

CCCGGGCTCC CGGCAGTGCG AGCAGCTAAG GGCTGGCCGC CGCCTCCCTG AGCTCCCCCG 

'• :•' ■■: . . ..- '.. ' ' ' - ; - ■ 120 

GMGCAGCCGA CCCCTGGTCG CGGCGTTCAC CTCGCCGATG CGCGGTTGGT AGGAGTGACC 

' ■■ : \ ■ ■■' ■ ""' ' 180 

GGAGCCATTC TCTCCTCGTT GATAAGATTC CCTACCAGGA TAGGAGCCf A TCTCCCTTTY 

'"'< .240 

CACAGCAGGA CACAGAAATC TGGCCTTCAG TACTTTGGGA AAAGGATCTG AGACCTCCTG 

- ;> ' '. " ■ . 300 

GAGCTCTGAC CACTGGCTGT CATCTGTGGC TCTGGCCTGT GTGGGCCACT GAGCTCTACT 

' ; '•■ I " ' •' ■ ' ". ' . "' . •/' ' '..''360 

CAAACATTAA AGAGGAGGAG GGGAGATCTG TGGAATGGGC CACCCCGTTG GCCTGCTGCA 

■ . ' : ' ' '."420 
TTACTGAACC TGCGCTGTCC ACACGTGCCC AGATCATGGG ACCCAGGGCC TGCTAGGGCT 

- ' ' ■ 475 ' 

AGGAGCGGGG CCCAGTATTC ATGGGTCTCT AGGCCTTTCC GAA ATG TCC GGG AAG 

• Met Ser Gly Lys 

■ "' ." , 523 ■ 

GGA GGC TGG GCC TGG TGG TGG GCG CGG CTG CCC CTC TGC CTA CTC CTC 

Gly Gly Trp Ala Trp Trp Trp Ala Arg Leu Pro Leu Cys Leu Leu Leu 

5. ■ /• 10 , . 15 ', 20 

v: " ' " : .'. ■'. '. . ' ■" . 57i 

•• AGG CTT. TAT, GCC CCC TGG GTG CCT TCA TCC TTG GGA AAG CCC AAG GGT 
; Ser Leu Tyr Ala Pro Trp Val Pro Ser Ser Leu Gly Lys Pro Lys Gly 



ro irp, vai rro ser aer Leu biy Lys rro Ly 

• 25 l , ;l ,. . ■ .30 ■ ■ ' 35 

; ". ■ • ■'• "' " ' . '.' ' " -619 

CAC CCC CAC ATG AAC TCT ATC CGA ATT GAC GGG GAC ATC ACA CTG GGA 
His Pro His Met Asn Ser. He Arg He Asp Gly Asp He Thr Leu Gly 

■ i ■ 40 ■ 45 •: ■ ' 50 

'' '■'!' • ; 667 

GGC CTG TTT CCC GTC CAC GGC CGT GGC TCT GAG GGT AAG GCC TGC GGG 

Gly Leu, Phe Pro Val His Gly Ara Gly Ser Glu Gly Lys Ala Cys Gly 

55 60 ■ . 65 • 

."'..'.•* ">'••■ . ■'■ ■ 715 

GAG CTG AAG AAG GAG AAA GGC ATC CAC CGC CTG GAG GCC ATG CTG TTT 
Glu Leu Lys Lys Glu Lys Gly He His Arg Leu Glu; Ala Met Leu Phe 

70 75 ■ . 80 ; - 

'■'.l- •' " : - ' '' '763 

GCC CTG GAG CGC ATC AAC AAT GAC CCG GAC CTA CTG CCC AAC ATC ACG 

Ala Leu Asp; Arg He Asn Asn Asp Pro Asp Leu Leu Pro Asn lie Thr 

85 90 95_ 100 

F/a 8A 



SUBSTITUTE SHEET 



WO 92/10583 ^ PCT/US91/09422 

•■ i~- 24/32 

811 

TTG GGC GCC CGC ATT CTG GAC ACC TGC TCG AGG GAC ACC CAC GCC . CTG 
Leu 'Gly Ala Arg He Leu Asp. Thr Cys Ser Arg Asp Thr His Ala Leu 

. 105 110 115 

859 

GAG CAG TCA CTG ACC TTT GTG CGG GCG CTC ATC GAG AAG GAC GGC ACG 
Glu Gin Ser Leu Thr Phe Val Arg Ala Leu He Glu Lys Asp Gly Thr 
. 120 125 130 

'. ■ 907 

GAG GTC CGC TGC GGC AGG CGG GGC CCG CCC ATC ATC ACC AAG CCC GAA 
Glu Val Arg Cys Gly Arg Arg Gly Pro Pro. He He Thr Lys Pro Glu 
135 140 145 

• ' ■ 955 

CGA GTG GTG GGT GTC ATT GGA GCT TCG GGG AGC TCC GTC TCG ATC ATG 

Arg Val Val Gly Val He Gly Ala Ser Gly Ser Ser Val Ser lie Met 
150 155 160 

' .. 1003 

GTG GCC AAC ATC CTC CGC CTC TTC AAG ATC CCT CAG ATC AGC TAT GCC 

Val Ala Asn He Leu Arg Leu Phe Lys He Pro Gin He Ser Tyr Ala 

165 170 175 180 

1051 

TCC ACG- GCC CCT GAC TTG AGT GAC AAC AGC CGC TAT GAC TTC TTC TCC 
Ser Thr Ala Pro Asp Leu Ser Asp Asn Ser Arg Tyr Asp Phe Phe Ser 

185 190 195 

1099 

CGG GTG GTG CCC TCA GAC ACA TAG CAG GCC CAG GCC ATG GTG GAT ATT 
Arg Val Val Pro Ser Asp Thr Tyr Gin Ala Gin Ala Met Val Asp lie 
200 205 210 

1147 

GTC CGA GCC CTC AAG TGG AAC TAT GTG TCC ACA CTG GCC TCA GAG GGC 
Val Arg Ala Leu Lys Trp Asn Tyr Val Ser Thr Leu Ala Ser Glu Gly 
215 220 - 225 

1195 

AGC TAC GGT GAG AGT GGT GTG GAG GCC TTT ATC CAG AAG TCC CGA GAG 
Ser Tyr. Gly Glu Ser Gly Val Glu Ala Phe He Gin Lys Ser Arg Glu 

. 230 V 235 : 240 

■ -1243 

AAC GGA GGT GTG TGC ATT GCC CAG TCG GTG AAG ATT CCA CGG GAA CCC 
Asn Gly Gly Val Cys He Ala Gin Ser Val Lys He Pro Arg Glu Pro 
245 250 255 260 

• 1291 

AAG ACG GGG GAG TTC GAC AAG ATC ATC AAA CGC CTA CTG GAA ACA TCC 
Lys Thr Gly Glu Phe Asp Lys He lie Lys Arg Leu Leu Glu Thr Ser 

265 270 275 

na 8& 



SUBSTITUTE SHEET 



WO 92/10583 PCT/US91/09422 

25/32 - . 1339 

AAT GCC AGG GGT ATC ATC ATC TTT GCC AAC GAG GAT GAC ATC AGG AGG 
Asn Ala Arg Gly He He He Phe Ala Asn. Glu Asp Asp He Arg Arg 
280 285 290 

1387 

GTG TTG GAG GCA GCT CGC AGG GCC AAC CAG ACC GGC CAC TTC TTT TGG 
Val Leu Glu Ala Ala Arg Arg Ala Asn Gin Thr Gly His Phe Phe Trp 
295 300 305 

.1435 

ATG GGT TCT GAT AGC TGG GGC TCC AAG AGT GCC CCT GTG CTG CGC CTT 
Met Gly Ser Asp Ser Trp Gly Ser Lys Ser Ala Pro Val Leu Arg Leu 
310 315 320 

1483 

GAG GAG GTG GCC GAG GGC GCA GTC ACC ATT CTC CCC AAG AGG ATG TCT 
Glu Glu Val Ala Glu Gly Ala Val Thr He Leu Pro Lys Arg Met Ser 
325 330 335 340 

• 1531 
GTT CGA GGG TTC GAC CGA TAC TTC TCC AGG CGC ACG CTG GAC AAC AAC 
Val Arg Gly Phe Asp Arg Tyr Phe Ser Ser Arg Thr Leu Asp Asn Asn 

345 350 355 

, ■ .'■ - 1579 

AGG CGC AAC ATC TGG TTT GCC GAG TTC TGG GAG GAC AAC TTC CAT TGC 
Arg Arg Asn He Trp Phe Ala Glu Phe Trp Glu Asp Asn Phe His Cys 
360 365 370 

1627 

AAG TTG AGC CGC CAC GCG CTC AAG AAG GGA AGC CAC ATC AAG AAG TGC 
Lys Leu Ser Arg His Ala Leu Lys Lys Gly Ser His He Lys Lys Cys 
375 380 385 

, 1675 

ACC AAC CGA GAG CGC ATC GGG CAG GAC TCG GCC TAT GAG CAG GAG GGG 
Thr Asn Arg Glu Arg lie Gly Gin Asp Ser Ala Tyr Glu Gin Glu Gly 
.. 390 395 400 

' ' ' ' ■ ■ 1723 

AAG GTG CAG TTC GTG ATT GAC GCT GTG TAC GCC ATG GGC CAC GCG CTG 
Lys Val Gin Phe Val He Asp Ala Val Tyr Ala Met Gly His Ala Leu 
405 410 415 420 

1771 

CAC GCC ATG CAC CGT GAC CTG TGT CCC GGC CGC GTA GGA CTC TGC CCT 
His Ala Met His. Arg Asp Leu Cys Pro Gly Arg Val Gly Leu Cys Pro 

425 430 435 

1819 

CGC ATG GAC CCC GTG GAT GGC ACC CAG. CTG CTT AAG TAC ATC AGG AAC 
Arg Met Asp Pro Val Asp Gly Thr Gin Leu Leu Lys Tyr He Arg Asn 
440 445 . 450 

na sc. 



SUBSTITUTE SHEET 



WO 92/10583 PCT/US91/09422 

* 26/32 1867 
GTC AAC TTC TCA GGC ATT GCG GGG AAC CCT GTA ACC TTC AAT GAG AAC 
Val Asn Phe Ser Gly Me Ala Gly Asn Pro Val Thr Phe Asn Glu Asn 
455 460 .465 

1915 

GGA GAC GCA CCG GGG CGC TAC GAC ATC TAC CAG TAC CAA CTG CGC AAT 
Gly Asp Ala Pro Gly Arg Tyr Asp lie Tyr Gin Tyr Gin Leu Arg Asn 
470 475 480 

1963 

GGC TCG GCC GAG TAC AAG GTC ATC GGC TCG TGG ACA GAC CAC CTG CAC 
Gly Ser Ala Glu Tyr Lys Val He Gly Ser Trp Thr Asp His Leu His 
485 490 495 500 

2011 

CTC AGA ATA GAG CGG ATG CAG TGG CCA GGG AGT GGC CAG CAG CTG CCG 
Leu Arg He Glu Arg Met Gin Trp Pro Gly Ser Gly Gin Gin Leu Pro 

505 510 515 

2059 

CGC TCC ATC TGC AGT CTG CCC TGC CAG CCC GGG GAG CGA AAG AAG ACT 
Arg Ser lie Cys Ser Leu Pro Cys Gin Pro Gly Glu Arg Lys Lys Thr 
520 525 . 530 

2107 

GTG AAG GGC ATG GCT TGC TGC TGG CAC TGC GAG CCC TGC ACC GGG TAC 
Val Lys Gly Met Ala Cys Cys Trp His Cys Glu Pro Cys Thr Gly Tyr 
535 540 545 

2155 

CAG TAC CAA GTG GAC CGC TAC ACC TGT AAG ACC TGC CCC TAC GAC ATG 
Gin Tyr Gin Val Asp Arg Tyr Thr Cys Lys Thr Cys Pro Tyr Asp Met 
550 555 560 

2203 

CGG CCC ACA GAG AAC CGC ACQ AGC TGC CAG CCC ATC CCC ATC GTC AAG 
Arg Pro Thr Glu Asn Arg Thr Ser. Cys Gin Pro He Pro He Val Lys 
565 . 570 575 580 

2251 

TTG GAG TGG GAC TCG CCG TGG GCC GTG CTG CCC CTC TTC CTG GCC GTG 
Leu Glu Trp Asp Ser Pro Trp Ala Val Leu Pro Leu Phe Leu Ala Val 

585 590 .595 

2299 

GTG GGC ATC GCC GCC ACG CTG TTC GTG GTG GTC ACG TTT GTG CGC TAC 
Val Gly He Ala Ala Thr Leu Phe, Val Val Val Thr Phe Val Arg Tyr 
600 605 , • 610 

2347 

AAC GAT ACC CCC ATC GTC AAG GCC TCG GGC CGG GAG CTG AGC TAC GTG 
Asn Asp Thr Pro He Val Lys Ala Ser. Gly Arg Glu Leu Ser Tyr Val 
615 620 625 

na so. 



SUBSTITUTE SHEET 
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27/32 

2395 

CTG CTG GCG GGC ATC TTT CTG TGC TAC GCC ACT ACC TTC CTC ATG ATC 
Leu Leu Ala GTy He Phe Leu Cys Tyr Ala Thf Thr Phe Leu Met He 
630 635 640 

2443 

GCA GAG CCG GAC, CTG GGG ACC TGT TCG CTC CGC CGC ATC TTC CTA GGG 
Ala Glu Pro Asp Leu Gly Thr Cys Ser Leu Arg Arg He Phe Leu Gly 
645 650 655 660 

2491 

CTC GGC ATG AGC ATC AGC TAC GCG GCC CTG CTG ACC AAG ACC AAC CGC 
Leu Gly Met 5er He Ser Tyr Ala Ala Leu Leu Thr Lys Thr Asn Arg 

665 670 675 

2539 

ATT TAC CGC ATC TTT GAG CAG GGC AAA CGG TCG GTC AGT GCC CCG CGT 
He Tyr Arg He Phe Glu Gin Gly Lys Arg Ser Val Ser Ala Pro Arg 
680 685 690 

' 2587 

TTC ATC AGC CCG GCC TCG CAG CTG GCC ATC ACC TTC ATC CTC ATC TCC 

Phe He Ser Pro Ala Ser Gin Leu Ala . lie Thr Phe He Leu He Ser 
695 700 705 

2635 

CTG CAG CTG CTC GGC ATC TGC GTG TGG TTC GTG GTG GAC CCC TCC CAC 

Leu Gin Leu Leu Gly He Cys Val Trp Phe Val Val Asp Pro Ser His 
710 715 720 

" • .. . 2683 

TCG GTG GTG GAC TTC CAG GAC CAA CGG ACA CTT GAC CCC CGC TTT. GCC 

Ser Val Val Asp Phe Gin Asp Gin Arg Thr Leu Asp Pro Arg Phe Ala 

725 730 .735 740 

2731 

AGG GGC GTG CTC AAG TGC GAC ATC TCG GAC CTG TCC CTC ATC TGC CTG 
Arg Gly Val Leu Lys Cys Asp He Ser Asp Leu Ser Leu He Cys Leu 

745 750 755 , 

2779 

CTG GGC TAC AGC ATG CTG CTG ATG GTC ACG TGT ACT GTG TAC GCC ATC 
Leu Gly Tyr Ser Met Leu Leu Met Val Thr Cys Thr Val Tyr Ala He 
760 765 770 

■■ 2827 

AAG ACC CGA GGC GTG CCC GAG ACC TTC AAC GAG GCC AAG CCC ATC GGC 
Lys Thr Arg Gly Val Pro Glu Thr Phe Asn Glu Ala Lys Pro lie Gly 
775 780 785 

2875 

TTC ACC ATG TAC ACC ACC TGC ATT GTC TGG CTG GCC TTC ATC CCC ATC 
Phe Thr Met Tyr Thr Thr Cys He Val Trp Leu Ala Phe He Pro He 
790 795 800 

na be. 



SUBSTITUTE SHEET 
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28/32 

2923 

TTT TTT GGC ACC TCA CAG TCA GCC GAC AAG CTG TAC ATC CAG ACA ACC 
Phe Phe Gly Thr Ser Gin Ser Ala Asp Lys Leu Tyr He Gin Thr Thr. 
805 810 815 820 

2971 

ACA CTG ACG GTC TCC GTG AGT CTG AGC GCT TCA GTG TCC CTG GGG ATG 
Thr Leu Thr Val Ser Val Ser Leu Ser Ala Ser Val Ser Leu Gly Met 

825 830 835 

3019 

CTC TAC ATG CCC AAA GTC TAC ATC ATC CTC TTC CAC CCG GAG CAG AAC 
Leu Tyr Met Pro Lys Val Tyr lie He Leu Phe His Pro Glu Gin Asn 
840 845 850 

3067 

GTG CCC AAG CGC AAG CGC AGT CTC AAA GCC GTG GTC ACC GCC GCC ACC 
Val Pro Lys Arg Lys Arg Ser Leu Lys Ala Val Val Thr Ala Ala Thr 
855 860 . 865 

3115 

ATG TCC AAC AAG TTC ACA CAG AAG GGC AAC TTC AGG CCC AAT GGG GAA 
Met Ser Asn Lys Phe Thr Gin Lys Gly Asn Phe Arg Pro Asn Gly Glu 
870 875 880 

3163 

GCC AAA TCA GAG CTG TGT GAG AAC CTG GAG ACC CCA GCG CTG GCT ACC 
Ala Lys Ser Glu Leu Cys Glu Asn Leu Glu Thr Pro Ala Leu Ala Thr 
885 890 895 900 

. 3209 

AAA CAG ACC TAC GTC ACC TAC ACC AAC CAT GCC ATC TAGCCGGGCC 
Lys Gin Thr Tyr Val Thr Tyr Thr Asn His Ala He 

905 910 

3269 

GCGGAGCCAA GCAGGCTAAG GAGCCACAAC CTCTGAGGAT GGCACATTGG GCCAGGGCCG 

3329 

TTCCCGAGGG; CCCTGCCGAT GTCTGCCCGC CTCCCGGGCA TCCACGAATG TGGCTTGGTG 

• 3389 
CTGAGGACAG, TAGAGACCCC GGCCATCACT GCTGGGCAAG CCGTGGTGGG CAACCAGAGG 

3449 

AGGCCGAGTG GCTGGGGCAG TTCCAGGTTA TGCCACACAC AGGTCTTCCT TCTGGACCAC 

3509 

TGTTGGCCCA .GCCCCAAAGC ACAGGGGCTC GGTCTCCAGA GCCCAGCCCT GGCTTCCTCT 

3569 

. CCTTCCTCCT GCCTCCGTCT GTCCTGTGGG TGACCCCGGT TGGTCCCTGC CCCGTCTTTA 

3629 

CGTTTCTCTT CCGTCTTTGC TCTGCATGTG TTGTCTGTTT GGGCCCTCTG CTTCCATATT 

na 8E 



SUBSTITUTE SHEET 
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29/32 3 6 89 

TTTCCATTCT 6CTCCTGGCC TTCCCCTGCC ATCTGCCCTG CCCCCTGCCC CTCCTCCCTG 

3749 

AGCTGCCCCA TCCCCGCCAT CATTTTCTCT TCTGTTCCCC CTCGATCTCA TTTCCTACCA 

3809 

GCCTTCCCCC TACTTGGCTT CATCCACCAA CTCTTTCACC ACGTTGCAAA AGAGAAAAAA 

3869 

AAAGGGGGGG GGGAATCACC CCCTACAAAA AAGCCCAAAC AAAAACTAAT CTTGAGTGTG 

3929 

TTTCGAAGTG CTGCGTCCTC CTGGTGGCCT GTGTGTCCCT GTGCCTGCAG CCTGTCTGCC 

3989 

CGCGCTACCC GTCTGCCGTG TGTCCTGCCC CCCCCGCCTG CCCGCCJTGC CCTTCCTGCT 

4049 

AACGACACGG AGTTCAGTGC CTGGGTGTTT GGTGATGGTC TCTGATGTGT AGCATGTCTG 

4096 

TTTTTATACC GAGAACATTT CTAATAAAGA TAAACACATG GTTTTGC . 



na da 



SUBSTITUTE SHEET 
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CCCAACATCA CGTTGGGCGC CCGCATTCTG GACACCTGCT CGAGGGACAC CCACGCCCTG 

120 

GAGCAGTCAC TGACCTTTGT GCGGGCGCTC ATCGAGAAGG ACGGCACGGA GGTCCGCTGC 

180 

GGCAGGCGGG GCCCGCCCAT CATCACCAAG CCCGAACGAG TGGTGGGTGT CATTGGAGCT 

' 1 240 
TCGGGGAGCT CCGTCTCGAT CATGGTGGCC AACATCCTCC GCCTCTTCAA GATCCCTCAG 

.. 300 

ATCAGGTATG CCTCCACGGC CCCTGACTTG AGTGACAACA GCCGCTATGA CTTCTTCTCC 

360 

CGGGTGGTGC CCTCAGACAC ATACCAGGCC CAGGCGATGG TGGATATTGT CCGAGCCCTC 

420 

AAGTGGAACT ATGTGTCCAC ACTGGCCTCA GAGGGCAGCT ACGGTGAGAG TGGTGTGGAG 

480 

GCCTTTATCC AGAAGTCCCG AGAGAACGGA GGTGTGTGCA TTGCCCAGTC GGTGAAGATT 

■ ■ 540 

CCACGGGAAC CCAAGACGGG GGAGTTCGAC AAGATCATCA AACGCCTACT GGAAACATCC 

600 

AATGCCAGGG GTATCATCAT CTTTGCCAAC GAGGATGACA TCAGGAGGGT GTTGGAGGCA 

660 

GCTCGCAGGG CCAACCAGAC CGGCCACTTC TTTTGGATGG GTTCTGATAG CTGGGGCTCC 

' : 720 

AAGAGTGCCC CTGTGCTGCG CCTTGAQGAG GTGGCCGAGG GCGCAGTCAC CATTCTCCCC 

780 

AAGAGGATGT CTGTTCGAGG GTTCGACCGA TACTTCTCCA GCCGCACGCT GGACAACAAC 

840 

AGGCGCAACA TCTGGTTTGC CGAGTT.CTGG GAGGACAACT TCCATTGCAA GTTGAGCCGC 

900 

CACGCGCTCA AGAAGGGAAG CCACATCAAG AAGTGCACCA ACCGAGAGCG CATCGGGCAG 

960 

GACTCGGCCT ATGAGCAGGA GGGGAAGGTG CAGTTCGTGA TTGACGCTGT GTACGCCATG 

1020 

GGCCAC.GCGC TGCACGCCAT GCACCGTGAC GTGTGTCCCG GCCGCGTAGG ACTCTGCCCT 

1080 

CGCATGGACC GCGTGGATGG CACCCAGCTG CTTAAGTACA TCAGGAACGT CAACTTCTCA 

1140 

GGCATTGCGG GGAACCC-TGT AAGG-TTGAAT GAGAACGGAG ACGGACCGGG GCGGTACGAC 

F/a BA 
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ATCTACCAGT ACCAACTGCG CAATGGCTCG GCCGAGTACA AGGTCATCGG CTCGTGGACA 

." : . . : > : " "- ■ 1260 

GACCACCTGC ACCTCAGAAT AGAGCGGATG CAGTGGCCAG GGAGTGGCCA GCAGCTGCCG 

1320 

CGCTCCATCT GCAGTCTGCC CTGCCAGCCC GGGGAGCGAA AGAAGACTGT GAAGGGCATG 

• ■ ■■■■ \ . • : . • • 1380 

GCTTGCTGCT GGCACTGCGA GCCCTGCACC GGGTACCAGT ACCAAGTGGA CCGCTACACC 

. • 1440 

TGTAAGACCT GCCCCTACGA CATGCGGCCC ACAGAGAACC. GCACGAGCTG CCAGCCCATC 

. :': y ' '. . . -. > : \ V- ■"■ 1500 

CCCATCGTCA AGTTGGAGTG GGACTCGCCG TGGGCCGTGC TGCCCCTCTT GCTGGCCGTG 

.• - '"' " ■ 1560 

GTGGGCATCG CCGCCACGCT GTTCGTGGTG GTCACGTTTG TGCGCTACAA CGATACCGCC 

•,■ . "_ '-• v ,- : ' ' 1620 

ATCGTGAAGG CCTCGGGCCG GGAGCTGAGC TACGTGCTGC TGGCGGGCAT CTTTCTGTGC 

TACGCCACTA CCTTCCTCAT GATCGCAGAG CCGGAGCTGG GGACCTGTTC GCTCCGCCGC 

■ ■ • " - V ~ 1740 

ATCTTCCTAG GGCTCGGC AT GAGCATCAGC TACGCGGCCC TGCTGACCAA GACCAACCGC 

■■ ' ■ '■■ ' : - '"' ; "v "• ' ;: • ':■ ' : v 1800 

ATTTACCGCA TCTTTGAGCA GGGCAAACGG TCGGTCAGTG CCCCGCGTTT CATCAGCCCG 

:■■ '. : - : 1860 

GCCTCGCAGC TGGCCATCAC CTTCATCCTC ATCTCCCTGC AGCTGCTCGG CATCTGCGTG 

. . 1920 

TGGTTCGTGG TGGACCCCTC CCACTCGGTG GTGGACTTCC AGGACCAACG GACACTTGAC 

" • : •: ■"• ' - , .. ' ■ .. 1980 

CCCCGCTTTG CCAGGGGCGT GCTCAAGTGC GACATCTCGG ACCTGTCCCT CATCTGCCTG 

' • . ,, ■ • . ■ • • ■ • ■■' 2040 

CTGGGCTACA GCATGCTGCT GATGGTCACG TGTACTGTGT ACGCCATCAA GACCCGAGGC 

■ r ' y ■' V ■ ■ ' ■ " . 2100 

GTGCCCGAGA CCTTCAACGA GGCCAAGCCC ATCGGCTTCA CCATGTACAC CACCTGCATT 

: " \"/V ' : - ' ' ' ■ ; • . 2160 

GTCTGGCTGG CCTTCATCCC CATCTTTTTT GGCACCTCAC AGTCAGCCGA CAAGCTGTAC 

- ' ' ' ' 2220 

ATCCAGACAA CCAGACTGAC GGTCTCCGTG AGTCTGAGCG CTTCAGTGTC CCTGGGGATG 

■' • . •'2280' 
CTCTACATGC CCAAAGTCTA" CATCATCCTC TTCCATATTT TTCCATTCTG CTCCTGGCCT 



r/a SB. 
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TCCCCTGCCA TCTGCCCT6C CCCCTGCCCC TCCTCCCTGA GCTGCCCCAT . CCCCGCCATC 

2400 

ATTTTCTCTT GTGTTCCCCC TCGATCTCAT TTCCTACCAG CCTTCCCCCT ACTTGGCTTC 

2426 

CTCCACCAAC TCTTTCACCA CGTTGC 
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the Amersham random-priming kit (Amersham, Arlington Hts, 
IL) • Duplicate lifts were prepar d from the plates, and 
the filters were hybridized with the probes in 50% 
formamide at 37 # C. After an overnight hybridization, the 
filters were washed in 2x SSC +0.1% SDS at 50 *C. 
Positive plaques were isolated by several rounds of 
dilution plating and repeated screening with the 
random-primed probes. 

Table 3 

nzy Agar 

To 950 ml of deionized water, add: 

10 g NZ amine: Casein hydrolysate enzymatic (ICN 

Biochemicals) 

5 g NaCl 

5 g bacto-yeast extract 

1 g casamino acids 

2 g MgS0 4 * 7H 2 0 

Shake until the solutes have dissolved, Adjust to 
pH 7.0 with 5 N NaOH (approximately 0.2 ml) . Adjust the 
volume of the solution to 1 liter with deionized H 2 0. 
Sterilize by autoclaving for 20 minutes. 

2QX SgC 

Dissolve 175.3 g NaCl and 88.2 g sodium citrate in 
800 ml H 2 0, Adjust the pH to 7.0 with a few drops of 10 N 
NaOH. Adjust the volume to 1 liter with H 2 0. Sterilize 
by autoclaving. 

Plasmid DNA was prepared from positive plagues 
using the Bluescript system (Stratagene Cloning Systems) . 
The plasmid DNA was subjected to restriction analysis and 
Southern blot analysis (Sambrook et al., ibid., which is 
incorporated her in by referencie) . Two clon s, SN23, 
derived from the total rat brain library, and SR2 , 
derived from th rat cerebellum library, were id ntified 
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as being different than the 45-A cl ne and were 
sequenc d. Seguenc analysis showed that they 
represented two additional subtypes. SN23 encodes 
subtype lb, which contains an additional 85 bp exon that 
5 encodes a new stretch of 20 amino acids and a stop codon 

in the intracellular domain, is 292 amino acids shorter 
than the 45-A clone. The nucleotide sequence and deduced 
amino acid sequence of clone SN23 are shown in Fig. 7. 
SR2 was found to contain a partial cDNA sequence encoding 

10 subtype 2a, which is a novel sequence that shares a 42% 

homology to the transmembrane domains and extracellular 
domain of the 45-A clone. 

A complete subtype 2a clone was obtained by 
rescreening both libraries as described above with the 

15 radiolabeled 1.3 kb Pst I fragment from clone 45-A and a 

radiolabeled 1.4 kb Eco RI-Pvu II fragment from SR2. Two 
additional clones were obtained. SN30, derived from the 
total rat brain library, contained the entire subtype 2a 
coding sequence. The nucleotide sequence and deduced 

2Q . amino acid sequence of clone SN30 are shown in Fig. 8. 

SR13, derived from the rat cerebellum library, contained 
an incomplete sequence of a new receptor subtype, 2b. 
Sequence analysis of SR13 showed that the coding sequence 
was incomplete at the 3' end and was virtually identical 

25 to the SN30 sequence except that it contained a 610 base 

pair deletion within the 3" terminus of SN30i The DNA 
sequence of the cDNA insert in clone SR13 is shown in 
Figure 9. 

The complete 3 • end of the subtype 2a clone was 
30 generated using PCR amplification and an oligonucleotide 

containing a sequence unique to SR13 (ZC4520, Table 4) 
and an oligonucleotide corresponding to a sequence near 
the 3' end of the 3' non-translated region of SN30 
(ZC4519, Table 4) . DNA was prepared from plate lysates 
35 of the original plating of each library. Each plate 

produced a pool of clones. For the PCR reactions, ten 
nanograms from each library and 100 pmol of ach 
oligonucleotide w re combined in a reaction volume of 50 
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Ml containing 50 xnM KC1, 10 mM Tris-HCl pH 9,0, 1.5 mM 
MgCl 2 , 0,1% Triton X-100, 0.01% gelatin, 0.2 mM each 
deoxynucleotide triphosphate and 2.5 units of Thermus 
aauaticus (Tag) DNA polymerase (Pr omega Corporation, 
Madison, WI) . The reaction mixture was overlaid with 
mineral oil. After five cycles (30 seconds at 94 # C, 30 
seconds at 45 # C, 1 minute at 50 # C) and twenty-five cycles 
(30 seconds at 94 *C, 30 seconds at 45 # C, 1 minute at 
72 *C) the amplified DNA was removed for analysis. 

Table 4 

Degenerate Oligonucleotide Primer Sequences f5' - 3M 
ZC4519 

TTT ATT AGA AAT GTT CTC GGT 
ZC4520 

CCT GTT CCA TAT TTT TCC ATT 
ZC4559 

ATA AGA ATT CAT NKR YTT NGC YTC RTT KAA 
ZC4560 

ATA AGA ATT CTT YRA YGA RAA NGG NGA YGC 
ZC4561 

ATA AGA ATT CGC NGG NAT HTT YYT NKG NTA 
ZC4562 

ATA AGA ATT CTA NCM NAR RAA DAT NCC NGC 
ZC4563 

ATA AGA AAT CAN GTN GTR TAC ATN GTR AA 

An aliquot from each reaction was electrophoresed 
on agarose and transferred to nitrocellulose for Southern 
analysis. Southern analysis of the PCR products showed 
that a 460 bp fragment corresponding to the 3* end of the 
2b sequence was present in several pools. One of the 
pools that produced the correct size PGR product encoding 
the 3' sequence of. the 2b subtype was diluted and 
scr ened with radiolabeled ZC4519 and ZC4520 (Table 4). 
Phage that hybridize to both radiolab led ZC4519 and 
ZC4520 are pick d, eluted, diluted, plated and rescreened 
with the oligonucleotide prob s. The screening is 
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repeated until a pure clone is obtained. The pure clone 
is sequenced, and a full-length clone is constructed 
using the most convenient restriction enzyme (s) . ■ 

: Based' on an alignment of the deduced amino acid 
sequences of subtypes la and 2a, strategies were designed 
for cloning additional subtypes using PGR amplification. 
Degenerate oligonucleotide families were prepared to 
encode conserved amino acid sequences in the sixth 
transmembrane domain, a region surrounding the conserved 
amino acid sequence Phe-Asp-Glu-Lys , the third 
cytoplasmic loop, and the second transmembrane domain 
(Table :4) . 

Glutamate receptor cDNA sequences were amplified 
with pairs of degenerate primers from Table 4 using the 
PCR method on cDNA from the total rat brain library, the 
cDNA from the rat cerebellum library, a rat cortex cDNA 
library or a rat hippocampus cDNA library (both obtained 
from Michael Brownstein, National Institutes of Health, 
Bethesda, MD) • The primers also each contained a 5 9 tail 
of 10 nucleotides, which provided convenient restriction 
enzyme sites. For each PCR reaction, ten nanograms from 
the library and 100 pmol of the oligonucleotide pools 
ZC4563 and ZC4560 (Table 4) were combined in a reaction 
volume of 50 fxl containing 50 mM KC1, 10 mM Tris-HCl pH 
9.0, 1.5 mM MgCl 2 , 0.1% Triton X-100, 0.01% gelatin, 0.2 
mM each deoxynucleotide triphosphate and 2.5 units of Taq 
DNA polymerase. The reaction mixture was overlaid with 
mineral oil. After five cycles (30 seconds at 94 # C, 30 
seconds at 45 # C, 1 minute at 50*0) and twenty-five cycles 
(30 seconds at 94 *C, 30 seconds at 45 # C, 1 minute at 
72 # C) the amplified DNA was removed for analysis. 

Ani aliquot from each reaction was electrophoresed 
on an agarose gel. Southern analysis of the gel was 
performed! using essentially the method described by 
Sambrook et al. (ibid.) and random-primed fragments 
cov ring the entir coding regions from both the subtype 
la and 2a clones. The autoradiographs showed that the 
PGR reaction generated fragments of novel size that wer 



WO 92/10583 . PCT/US91/09422 

>V- 61 

diff rent from either the la or 2a subtyp . The 
PCR-generated fragments w r electrophoresed n an 
agarose gel. Regions corresponding to the unique-sized 
receptor-related products were excised and 
5 electrophoresed onto NA45 paper (Schleicher and Schuell, 

Keene, NH) . The purified fragments were recovered using 
essentially the method described by the manufacturer, 
digested with Eco RI and ligated to plasmid pVEGT' that 
had been linearized by digestion with Eco RI and treated 

10 with phosphatase to prevent recircularizat ion. The 

ligation mixtures were transformed into fi. coli strain 
DHlOb cells. Transformants were picked and replica 
plated onto nitrocellulose filters and screened using 
random-primed probes from the la and the 2a clones. 

15 Forty-eight colonies were picked for restriction analysis 

and sequencing. 

DNA sequences from the cDNA from the total rat 
brain library and the cDNA from the rat cerebellum 
library were each amplified and analyzed using the 

20 methods described above and oligonucleotide ZC4 559. in 

combination with either ZC4561 or ZC4559 (Table 4). 

A rat cortex cDNA library and a rat hippocampus 
cDNA library (both obtained from Michael Brownstein, NIH) 

/ are subdivided into 30 pools of 10,000 colonies. Plasmid 

25 DNA is prepared from each pool, and the DNA is subjected 

to Southern analysis after restriction digestion of the 
pools with Bam HI and Xho I or by PGR amplification of 
each pool using the degenerate oligonucleotides of Table 
4. The library pools containing DNA that hybridize to 

30 the probes and appear to contain a full-length cDNA are 

subdivided. The plasmid DNA is prepared and screened as 
described above. Positive pools are again divided and 
the procedure is continued until the pool is reduced to 
pure clones. The clones are subjected to restriction 

35 analysis and partial sequence analysis. Clon s that 

represent distinct glutamate receptor homo logs are 
complet ly s qu need. Full length clon s are g n rated 
by subj cting the original pools to PCR amplification 
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using an oligonucleotide primer specific to the SP6 
promoter at the '5 • hd of the cDNA insert and an 
antisense oligonucleotide primer corresponding to the 5' 
end of the most complete cDNA to identify pools that 
contain the longest glutamate receptor homolog cDNA. The 
pool is then diluted and rehybridized with the probes as 
described above to isolate a full length cDNA clone. 

Evpreasion of Glutma »-« Receptor Subtypes 
Complementary DNA sequences encoding subtypes lb 
and 2a were subcloned first into the mammalian expression 
vector Zem228R to obtain convenient terminal restriction 
sites. The cONAs were then subcloned into pVEGT • . The 
cDNA sequence encoding subtype lb was constructed by 
replacing the 3« terminal portion of subtype la described 
in Example I with the analogous portion of subtype lb 
from SN23. Plasmid SN23 was digested with Kpn I and Xba 
I to isolate the fragment containing the 3« terminus of 
the lb subtype. The plasmid containing the subtype la 
coding sequence (45-A) in Zem228R was digested with Kpn I 
and Xba I to isolate the vector containing fragment. The 
vector containing fragment is ligated to the Kpn I-Xba I 
fragment from SN23. The resulting plasmid comprises the 
MT-^l promoter, the subtype lb cDNA and the hGH 
terminator. This plasmid was transfected into the BHK 
570 cell line essentially as described in Example I to 
obtain stably transfected cell lines expressing the 
subtype lb receptor. The subtype lb cDNA fragment was 
isolated as a Bam HI fragment, which was ligated with 
pVEGT' that had been linearized with Bam HI. A plasmid 
containing the cDNA sequence in the correct orientation 
was used to synthesize; RNA in an in vitro system. The 
RNA was inj ected into oocytes as described above . 

Plasmid SN30, which comprises the subtype 2a cDNA, 
was digest d with Eco RI to isolat the subtype 2a cDNA. 
Th Eco RI f ragm nt was ligated with Eco Rl-lin arized 
Zem228R. . A plasmid c ntaining the insert in th correct 
ori ntation was digested with Bam HI to isolate the cDNA 



sequence. The Bam HI fragment comprising th subtype 2a 
cDNA was ligated with Eco Rl-lihearized pVEGT 1 • A 
plasmid containing the cDNA in the correct orientation 
was used to synthesize RNA in an in vitro translation. 
The RNA was injected into frog oocytes as described 
above. 

EXAMPLE III 

Generation of antibodies to alutamate receptor subtypes 
Receptor subtype-specific polyclonal ant isera 
were generated in rabbits using standard immunization 
techniques. Synthetic peptides (Table 5) were designed 
from the cloned receptor sequences. The peptides were . 
conjugated to keyhole limpet hemocyanin, and each antigen 
was used to immunize two animals. For each peptide, the 
animals were injected with 100-200 of conjugated 
peptide divided among three subcutaneous sites. The 
animals were immunized at three-week intervals and bled 
via an ear vein 10 days after the third and subsequent 
immunizations. 







Table 5 




Subtype 
la 


Seq. ID 
No. 


Peptide 
Sequence 


Apparent 
Location 


21 


RDSLISIRDEKDGLNRC 


extracellular 




22 


DRLLRKLRERLPKARV 


extracellular 




23 


EEVWFDEKGDAPGRYD 


extracellular 




24 


EFVYEREGNTEEDEL 


cytoplasmic . 




25 


PERKCCEIREQYGIQRV 


extracellular 




26 


IGPGSSSVAIQVQNLL 


extracellular 




27 


IAYSATSIDLSDKTL 


extracellular 


lb 


28 


KKPGAGNAKKRQPEFS 


cytoplasmic 




29 


PEFSPSSQCPSAHAQL 


cytoplasmic 


2a 


30 


DKI IKRLLETSNARG 


extracellular 




31 


VNFSGIAGNPVTFNEN 


extracellular 




32 


GEAKSELCENLETPAL 


cytoplasmic 


2b 


33 


PARLALPANDTEFSAWV 


cytoplasmic 
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10 



15 



Anti-peptide antibodies were purified by affinity 
purification using the Proton™ Kit (Multiple Peptide 
Systems (San Diego, CA) . Purified antibodies were stored 
in column elution buffer and neutralizing buffer 
(supplied, by Multiple Peptide Systems). Bovine serum 
albumin was added to a concentration of 1 mg/ml, and 
sodium azide was added to a concentration of 0.05%. The 
antibodies were stored at 4 *C or in small aliquot s at 
-20*C. 

Antibodies generated from the peptides listed in 
Table 6 were used to detect G protein-coupled glutamate 
receptors by Western blot analysis of membranes prepared 
from transf ected cell lines that were stably expressing 
the subtype la or subtype lb receptors. Control cell 
lines were transf ected with vector alone. 



20 



25 



30 



35 



Table 6 

Analysis of Antibodie s Raised to Peptides 



Antibodies to 
Penttde Sequence 

RDSLISIRDEKDGLNRC 

DRLLRKLRERLPKARV 

EEVWFDEKGDAPGRYD 

EFVYEREGNTEEDEL 

KKPGAGNAKKRQPEFS 

PEFSPSSQCPSAHAQL 



Seq. ID 
No. 

21 
22 
23 
24 
28 



extracellular 

extracellular 

extracellular 

cytoplasmic 

cytoplasmic 



29 cytoplasmic 



Western 

. +++ with bkgd 
+ 

++++ low bkgd 
■ h i > low bkgd 
+ for la 
- for lb 
+++ for lb low bkgd 



40 



Transf ectants that were stably expressing either 
the la or lb subtype were each grown to conf luency in 
five to ten 150 mm plates. Each plate was first washed 
twice with 15 ml of cold PBS and then 20 ml of ice cold 
10 mM NaHCOS i was added to each plate. The cells from 
each plate were scraped off th plat s with a rubber 
spatula and transferred to a glass dounce homogeniz r on 
ice. The cells w r disrupted with ten strok s of the B 
pestle. The homog nates from each plat w re c mbin d 
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and centrifuged for thirty minutes at 3000 rpm at 4 # C. 
The pellets were resuspended in 4-8 ml of 10 mM NaHC0 3 , 
using a 22 g needle and syringe, and €9% sucrose was 
added (6-12 ml) to each sample Until an index of 
5 refraction of 1.410 was reached • The samples were 

transferred to a high speed centrifugation tube, and each 
sample was overlayed with 42% sucrose. The samples were 
centrifuged for two hours at 25,000 rpm at 4*C The 
samples were collected by gently floating the membranes 

10 off the 42% sucrose layer by adding 1 ml of 10 mM NaHC0 3 

and resuspending the membranes by carefully stirring the 
upper; layer* The upper layer was transferred to a fresh 
tube on ice. The purified membranes were centrifuged at 
10,000 rpm at 4*C and the pellets resuspended in 10 mM 

15 NaHC0 3 . The purified membranes were then adjusted to a 

final protein concentration of 1-2 /xg/ml. 

Ten to twenty micrograms of each purified membrane 
preparations were diluted with 2x SDS-mercaptoethanol 
buffer (100 mM Tris HCl (pH 6.8), 200 mM dithiothreitol , 

20 4% SDS, 0.2% bromophenbl blue, 20% glycerol) . The, 

samples were incubated for 15 minutes at 37 # C followed by 
boiling for 5 minutes. The samples were subjected to 
SDS-PAGE on 4-15% gradient gel. The samples were 
electrotransf erred to nitrocellulose using the method 

25 essentially described by Towbin ( Proc, Natl. Acad. Sci. 

USA 76; 4350-4354, 1979; which is incorporated herein by 
reference in its entirety). After transfer, the 
nitrocellulose was cut into strips such that each strip 
contained a control and receptor samples. The 

30 nitrocellulose was preincubated in blocking buffer and 

then incubated with a dilution of either the preimmune 
serum or the serum collected after antigenic stimulation 
(serum from later bleeds (i.e. those after four antigen 

stimulations) were diluted 1:1500). After washing, a 

■< . ■ 

35 horse radish peroxidase-conjugated goat anti-rabbit 

antibody (Bi -Rad Laboratories, Richm nd, CA) diluted 
is 2, 500? was add d and after incubation and washing, the 
horse radish peroxidase substrate (Bio-Rad Laboratories) 
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was add d and the color reaction was initiated. The 
reaction was stopped by rinsing the filters in distilled 
water. Table 6 shows the results of the- Western blot 
analysis. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Mulvihill, Eileen R. 

Hagen, Frederick S. 
Houamed, Khaled M. 
Aimers, Wolf hard 

(ii) TITLE OF INVENTION: G PROTEIN-COUPLED GLUTAMATE RECEPTORS 
(iii) NUMBER OF SEQUENCES: 33 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Towns end and Tovnsend 

(B) STREET: one Market Plaza, Steuart Street Tower 
% (C) CITY: San Francisco 

(D) STATE: California 

(E) COUNTRY: USA 

(F) ZIP: 94105-1492 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1*25 

(vi) CURRENT APPLICATION DATA: 
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(B) FILING DATE: 
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(2) INFORMATION FOR SEQ ID NO:l: 
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95 



100 105 



TCC TGC TGG CAC TCT TCA GTG GCT CTC GAA CAG AGC ATC GAA TTC ATC 
llr S?S Trt Ss ier Ser val Ala Leu Glu Gin Ser II; Glu Phe lie 
110 115 XrfW 

... r « e mce CTG ATT TCC ATC CGA GAT GAG AAG GAT. GGG CTG AAC CGA 
Ar5 Sp IS 2S S sS ne Arg Asp Glu Lys Asp Gly Leu Asn Arg 

125 I 30 135 

TGC CTG CCT GAT GGC CAG ACC CTG CCC CCT GGC AGG ACT AAG AAG CCT 
Ifs Su S Sp Gly Gin Thr l*u Pro Pro Gly Arg Thr Lys Lys Pro 
140 1^5 150 



ATT GCT GGA GTG ATC GGC CCT GGC TCC AGC TCT GTG GCC ATT CAA GTC 
lie Ala Gly Val lie Gly Pro Gly Ser Ser Ser Val Ala lie Gin val 

. 160 l fi5 
CAG AAT CTT CTC CAG CTG TTC GAC ATC CCA CAG ATC GCC TAT TCT GCC 

Asn 5u 2S Sn Leu Phe Asp lie Pro Gin lie Ala Tyr Ser Ala 
175 I 80 ' 

ACA AGC ATA GAC CTG ACT GAC AAA ACT TTG TAC AAA TAC TTC CTG AGG 

tS sir S 2p Su ser Asp Lys Thr Leu Tyr Lys Tyr Phe Leu Arg 

190 ■■ 195- ■ < .*V.y . 

CTG GTC CCT TCT GAC ACT TTG CAG GCA AGG GCG ATG CTC GAC ATA GTC 
SS S5 S.52 Sp Jhr Leu Gin Ala Arg Ala Met Leu Asp He Val 
205 210 215 

«=T TAC AAC TGG ACC TAT GTC TCA GCA GTC CAC ACA GAA GGG AAT 

iyS S A^S f£ £r Tyr Val Ser Ala Val His Thr Glu Gly Asn 

220 225 . 230 

S5 SEE £ S S SSSM.S S SS - 

2 4 0 

GGC CTC TGC ATC GCA CAC TCG GAC AAA ATC TAC AGC AAT GCT GGC GAG 
SS.'S Ue Si His Ser Asp Lys lie Tyr Ser Asn Ala Gly Glu 

jusc TTT GAC CGG CTC CTG CGT AAA CTC CGG GAG CGG CTT CCC AAG 
iyl sir Sp £ 2u Leu Arg Lys Leu Arg Glu Arg Leu Pro Lys 

270 275 
GCC AGG GTT GTG GTC TGC TTC TGC GAG GGC ATG ACA GTG CGG GGC TTA 

Ala Arg val val val cys Phe Cys Glu Gly Met Tlu vai A*g <-.y i« 

285 290 

.= s s'is s sa is s sg s = - -I 

300 305 

GGA AGT GAT GGA TGG GCA GAC AGA GAT GAA GTC ATC GAA GGC TAT GAG 
Gly Ser Asp Gly Trp Ala Asp Arg Asp Glu val ne eiu ^ 

320 323 
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841 



889 



937 



985 



1033 



1081 



1129 



1177 



1225 



1273 



1321 



1369 
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s a s a a sisa g-ss ss a a s 



335 340 



5 12 a S ■ S SS. a S £ a S SS 3 & - 22. .3 

350 355 

as as aa sssas i a ss aa 

365 370 

aa a a a a 'a a a a a a -a a a s 



380 

:?s a a a a a a a a a a a .a a - a a ■ 
sss'ss jr. a' SkS^ a. a ga a 

430 435 

a s a a aa a ass a as a a a a 
Wis £ aa I as a a a a a sag 



460 

CCC £ ATT » »T «. CJG T»C g £ 

Pro Gly Arg Tyr Asp lie Met Asn wa 490 

480 4HD 

a ; aasraa aaaa saigas 

495 

aaa a aa aga s * .= i - a s 

510 515 

a a a a a a a - a a ? ■ a a.s a a a 

525 530 
GAA GTG AGC TGC TGC TGG ATC TGC ACG GCC TGC AAA GAG AAT GAG TTT 
Glu Val Ser Cys cys Trp lie Cys Thr Ala cys ugm * ^ 
540 545 

GTG CAG GAC GAG TTC ACC TGC AGA GCC TGT GAC CTG GGG TGG TGG CCC 

SUBSTITUTE SHEET 



1417 



1465 



1513 



1561 



1609 



1657, 



1705 



1753 



1801 



1849 



1897 



1945 



1993 



2041 



2089 
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Vai Gin Asp Glu Phe Thr cys Arg Ala cys Asp Leu Gly Trp Trp Pro 

560 565 50 



AAC GCA GAG CTC ACA GGC TGT. GAG CCC ATT CCT GTC CGT TAT CTT GAG 2137 
Asn Ala Glu Leu Thr Gly Cys Glu Pro lie Pro Val Arg Tyr Leu Glu 
575 580 585 

TGG AGT GAC ATA GAA TCT ATC ATA GCC ATC GCC TTT TCT TGC CTG GGC 2185 
Sp Ser Asp lie Glu Ser lie lie Ala lie Ala Phe Ser Cys Leu Gly 
590 595 600 

ATC CTC GTG ACG CTG TTT GTC ACC CTC ATC TTC GTT CTG TAC CGG GAC 2233 
lie Leu Sal Thr Leu Phe Val Thr Leu lie Phe Val Leu Tyr Arg Asp 
605 610 615 

ACA CCC GTG GTC AAA TCC TCC AGT AGG GAG CTC TGC TAT ATC ATT CTG 2281 
Thr Pro val Val Lys Ser ser Ser Arg Glu Leu cys Tyr He lie Leu 
-,20 625 630 635 

OCT GGT ATT TTC CTC GGC TAT GTG TGC CCT TTC ACC CTC ATC GCC AAA 2329 
Ala Gly lie Phe Leu Gly Tyr val cys Pro Phe Thr Leu He Ala Lys 

6 40 645 650 

CCT ACT ACC ACA TCC TGC TAC CTC CAG CGC CTC . CTA GTT GGC CTC Tg 2377 
Pro Thr Thr Thr Ser cys Tyr Leu Gin Arg Leu Leu Val Gly Leu. Ser. 
655 660 565 

TCT GCC ATG TGC TAC TCT GCT TTA GTG ACC AAA ACC AAT CGT ATT GCA 
Sr Sa Se? cjs ?yr Ser Ala Leu Val Thr Lys Thr Asn Arg He Ala 
670 i 675 680 

CGC ATC CTG GCT GGC AGC AAG AAG AAG ATC TGC ACC CGG AAG CCC AGA 
Sg S 2S Gly Ser Lys Lys Lys lie Cys Thr Arg Lys Pro Arg 
685 1 690 695 

TTC ATG AGC GCT TGG GCC CAA GTG ATC ATA GCC TCC ATT CTG ATT AGT 
Se 5e? sS S.S'AU Gin Val lie lie Ala Ser lie Leu lie Ser 
700 ! 705 710 /13 

rir . CTA A CA CPA GTG GTG ACC TTG ATC ATC ATG GAG CCT CCC ATG 
SS Sn Su Th^ SS Si Si Thr Leu lie He Met Glu Pro Pro Met 

720 725 



CCC ATT TTG TCC TAC CCG AGT ATC AAG 'GAA GTC TAC CTT ATC TGC AAT 
no He Leu sir Tyr Pro Ser He Lys Glu Val Tyr Leu He Cys Asn 
735 740 

&CC AGC AAC CTG GGT GTA GTG GCC CCT GTG GGT TAC AAT GGA CTC CTC 
tS sir £n Su 2? Val Val Ala Pro Val Gly Tyr Asn Gly Leu Leu 

750 I 755 
x „ . rr T 1. T ACC TAC TAT GCC TTC AAG ACC CGC AAC GTG CCG GCC 

SS mS sir C?s Thr Sr Ala Phe Lys Thr Arg Asn val Pro Ala 
765 j 770 775 

S-SK5 & S £ S ff E ST" S S SSS5 
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78C- 785 790 795 

ATC ATC TGG CTG GCT TTC GTT CCC ATT TAC TTT GGG AGO AAC TAC AAG 2809 
lie lie Trp Leu Ala Phe Val Pro lie Tyr* Phe. Gly Ser Asn Tyr Lys 

800 805 810 

ATC ATC ACT' ACC TGC TTC GCG GTG AGC CTC AGT GTG ACG GTG GCC CTG 2857 
He He Thr Thr Cys Phe Ala Val Ser Leu Ser Val Thr Val Ala Leu 
815 320 825 

GGG TGC ATG TTT ACT CCG AAG ATG TAC ATC ATC ATT GCC AAA CCT GAG 2905 
Gly Cys Met Phe Thr Pro Lys Met Tyr He lie He Ala Lys Pro Glu 
830 835 840 

AGG AAC GTC CGC AGT GCC TTC ACG ACC TCT GAT GTT GTC CGC ATG CAC 2953 
Arg Asn Val Arg Ser Ala Phe Thr Thr Ser Asp Val Val Arg Met His 
845 850 855 

GTC GGT GAT GGC AAA CTG CCG TGC CGC TCC AAC ACC TTC CTC AAC ATT 3001 
Val Gly Asp Gly Lys Leu Pro Cys Arg Ser Asn Thr Phe Leu Asn He 
860 865 870 875 

TTC CGG AGA AAG AAG CCC GGG GCA GGG AAT GCC AAT TCT AAC GGC AAG 3049 
Phe Arg Arg Lys Lys Pro Gly Ala Gly Asn Ala Asn Ser Asn Gly Lys 

880 885 890 

TCT GTG TCA TGG TCT GAA CCA GGT GGA AGA CAG GCG CCC AAG GGA CAG 3097 
Ser Val Ser Trp Ser Glu Pro Gly Gly Arg Gin Ala Pro Lys Gly Gin 
895 900 905 

CAC GTG TGG CAG CGC CTC TCT GTG CAC GTG AAG ACC AAC GAG ACG GCC 3145 
His Val Trp Gin Arg Leu Ser Val His val Lys Thr Asn Glu Thr Ala 
910 . 915 920 

TCT AAC CAA ACA GCC GTA ATC AAA CCC CTC ACT AAA AGT TAC CAA GGC 3193 
cys Asn Gin Thr Ala Val lie Lys Pro Leu Thr Lys Ser Tyr Gin Gly 
925 930 935 

TCT GGC AAG AGC CTG ACC TTT TCA GAT GCC AGC ACC AAG ACC CTT TAC 3241 
Ser Gly Lys Ser Leu Thr Phe Ser Asp Ala Ser Thr Lys Thr Leu Tyr 
940 . - 945 950 955 

AAT CTG GAA GAA GAG GAC AAT ACC CCT TCT GCT CAC TTC AGC CCT CCC 3289 
Asn Val Glu Glu Glu Asp Asn Thr Pro Ser Ala His Phe Ser Pro Pro 

960 965 970 

AGC AGC CCT TCT ATG GTG GTG CAC CGA CGC GGG CCA CCC GTG GCC ACC 3337 
Ser Ser Pro Ser Met Val Val His Arg Arg Gly Pro Pro Val Ala Thr 
975 980 985 

ACA CCA CCT CTG CCA CCC CAT CTG ACC GCA GAA GAG ACC CCC CTG TTC 3385 
Thr Pro Pro Leu Pr Pro His Leu Thr Ala Glu Glu Thr Pro Leu Phe 
990 995 1000 

CTG GCT GAT TCC GTC ATC CCC AAG GGC TTG CCT CCT CCT CTC CCG CAG 3433 
Leu Ala Asp Ser Val He Pro Lys Gly Leu Pr Pr Pr Leu Pro Gin 
1005 1010 1015 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1199 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

„et val Arg Leu Uu Uu He Ph. Ph. Pro Met lie Phe Leu Glu Met . 

1 5 ■ 

S.r II. 'im Pro AT, H.t Pro A* Ar, I*. V.1 fm L.U Ala «, 
s.r M r «1 Al. at, K. E A.P «» «* « S * 

io. lj. Phi «r »i Hi. Hi. Gin Pro Pro. Al;. W. «» » 6l » . . 
„, * cy. W =1U II. AT, «lu 8 ln.Tyr «y XI. «ln Ar, Clu 
*H W Ph. HI. m U» «P * II. A.; »1. "P Pro «1 ^ 

Pro a» ii. Thr uu «iy s« « g; »o s « <*■ -S Hi " *" 

100 

s« vi Ai. l.u «m s .r u. «u ph. i« m »P «• 

115 lzg 

,,r ii. at, ^^:. {8 -«ir;^^«ss:«r . . ; 

Olh X UU. Pro Pro ? ly Ar, Thr L». ty. Pro . II. Al. Oly V«l 11; 

145 ^ 150 

8l y Pr. «ly s« s« s„ v.i Al. il. W«*>- 

165 

„u p„. A.p II. Pro «ln II. Al. Tyr S.r Al. Thr s« II. A.p 

180 1 , 

3«r Asp Iy. Thr L~ Tyr Iy.Tyr.Ph. U. Ar, v.l v.l;Pro s« «P 

195 * U " m ' 

L» Cl» Al. Ar, Al ? H.t «. A.P II v.l Ly. Ar, Tyr A.n Trp 

210 215 

^ v.1 s r Al. v.l Hi. Th, clu «iy »h Tyr «y «iu S« « ? 
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Met Asp Ala Phe Lys Glu Leu Ala Ala Gin Glu Gly Leu Cys lie Ala 

245 250 

His ser Asp Lys lie Tyr Ser Asn Ala Gly Glu Lys Ser Phe Asp Arg 

260 265 ' * : 

Leu Leu Arg Lys Leu Arg Glu Arg Leu Pro Lys Ala Arg Val Val Val ' 

275 280 285 .• 

cys Phe cys Glu Gly Met Thr val Arg Gly Leu Leu Ser Ala Met Arg 

290 295 300 

Arg Leu Gly Val Val Gly Glu Phe Ser Leu lie Gly Ser Asp Gly Trp 

305 310 315 , 

Ala Asp Arg Asp Glu Val lie Glu Gly Tyr Glu Val Glu Ala Asn Gly 

Gly lie Thr lie Lys Leu Gin Ser Pro Glu Val Arg Ser Phe Asp Asp 
340 345 

Tyr Phe Leu Lys Leu Arg Leu Asp Thr Asn Thr Arg Asn Pro Trp Phe 

355 360 

Pro Glu Phe Trp Gin His Arg Phe Gin Cys Arg Leu Pro Gly His Leu 

370 375 

Leu Glu Asn Pro Asn Phe Lys Lys Val Cys Thr Gly Asn Glu Ser Leu 
385 390 

Glu Glu Asn Tyr Val Gin Asp Ser Lys Met Gly Phe Val lie Asn Ala 

405 410 

lie Tyr Ala Met Ala His Gly Leu Gin Asn Met His His Ala Leu cys 
420 425 

Pro Gly His Val Gly Leu Cys Asp Ala Met. Lys Pro lie Asp Gly Arg 

435 

Lys Leu Leu Asp Phe Leu lie Lys Ser Ser Phe Val Gly Val ser Gly 
450 4 ^5 4oo 

Glu Glu val Trp Phe Asp Glu Lys Gly Asp Ala Pro Gly Arg Tyr Asp 

465 470 , 

lie Met Asn Leu Gin Tyr Thr Glu Ala Asn Arg Tyr Asp Tyr Val His 

485 490 . 

Val Gly Thr Trp His Glu Gly Val Leu Asn lie Asp Asp Tyr Lys lie 

Ser 

515 520 525 



500 505 
Gin Met Asn Lys Ser Gly Met val Arg Ser Val Cys Ser Glu Pr cys 



uu Lys Gly Gin II Lys val lie Arg Lys Gly Glu Val Ser cys cys 

SUBSTITUTE SHEET 
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Trp-lie cys Thr Ala cys Lys Glu Asn Glu Phe val Gin Asp Glu Phe 



T ~ u eiv Trp Trp Pro Asn Ala Glu Leu Thr 
Thr cys Arg Ala Cys Asp Leu exy irp 575 

565; 

Gly cys Glu Pro lie Pro Val Arg Tyr Leu Glu Trp Ser Asp lie Glu 
Ser lie lie Ala lie Ala Phe Ser cys Leu Gly lie Leu Val Thr Leu 



595 



600 



Phe Val Thr Leu lie Phe Val Leu Tyr Arg Asp Thr Pro Val Val Lys 



610 



615 



Ser Ser Ser Arg Glu Leu cys Tyr lie lie Leu Ala Gly lie Phe Leu 

625 ■ 630 

<5y Tyr Val cys Pro Phe Thr Leu lie Ala Lys Pro Thr Thr Thr Ser 



645 



cys Tyr Leu Gin Arg Leu Leu Val Gly Leu Ser Ser Ala Met Cys Tyr 

660 665 
ser Ala Leu Val Thr Lys Thr Asn Arg lie Ala Arg lie Leu Ala Gly 
675 680 



s.r Ly, Ly. Lys II. cys Thr Ar, Ly. Pre Ar, Ph. .** S« Al. Trp 

690 695 
Ala Gin Val lie lie Ala Ser lie Leu lie Ser Val Gin Leu Thr Leu 



705 710 



_ T . Tlia MD> . fllu pro p r o Met Pro lie Leu Ser Tyr 
Val Val Thr Leu lie lie Met Glu pro m n ?35 

725 730 
Pro »'n. Ly. Glu v.! Tyr L.» II. Cy. A.n Thr s« ».n L« Oly 

740 745 

VI V* Ala' Pro V.1 «y Tyr ». cly Uu Uu U. M« S.r Cy. Thr 

755 —~ 760 

Tyr tyr *1. Ph. Ly. Thr Ar, A«, v.1 Pro Al. A.J Ph. A.n Glu Al. 

770 775 

Lys Tyr lie Ala Phe Thr Met Tyr Thr Thr Cys lie lie Trp Leu Ala 

785 790 

Ph.v.lPr II. Tyr Ph. Gly s« «n Tyr Ly. II II. Thr Thr cy. 
Ph. Al. v.1 s« "u s.r v.1 Thr v.1 Al. l~ civ cy. gt Ph. Thr 

820 823 

„ * ti« He He Ala Lys Pr Glu Arg Asn Val Arg Ser 
Pro Lys Met Tyr lie lie lie aa* mr» 845 

SUBSTITUTE SHEET 
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Ala Phe Thr Thr Ser Asp Val Val Arg Met His . Val Gly Asp Gly Lys 
850 j i 855 860 

Leu Pro cys Arg Ser Asn Thr Phe Leu Asn lie Phe Arg Arg Lys Lys 
865 870 875 880 

Pro Gly Ala gW Asn Ala Asn Ser Asn Gly Lys Ser val Ser Trp Ser 

885 890 

Glu Pro Gly Gly Arg Gin Ala Pro Lys Gly Gin His val Trp Gin Arg 
900 905 9 . 

1 

Leu ser Val His val Lys Thr Asn Glu Thr Ala Cys Asn Gin Thr Ala 
915 920 925 

Val He Lys Pro Leu Thr Lys Ser Tyr Gin Gly Ser Gly Lys Ser Leu 
930 935 940 

Thr Phe Ser Asp Ala Ser Thr Lys Thr Leu Tyr Asn Val Glu Glu Glu 
945 950 955 960 

Asp Asn Thr Pro ser Ala His Phe Ser Pro Pro Ser Ser Pro Ser Met 

965 970 973 

val Val His Arg Arg Gly Pro Pro Val Ala Thr Thr Pro Pro Leu Pro 
980 985 9 90 

Pro His Leu Thr Ala Glu Glu Thr Pro Leu Phe Leu Ala Asp Ser Val 

995 1000 1005 . 

lie Pro Lys Gly Leu Pro Pro Pro Leu Pro Gin Gin Gin Pro Gin Gin 
1010 1015 1020 

Pro Pro Pro cin Gin Pro Pro Gin Gin Pro Lys Ser Leu Met Asp Gl^ 
1025 1030 1035 "40 

Leu Gin Gly Val Val Thr Asn Phe Gly Ser Gly He Pro Asp Phe His 

1045 1050 iW33 

Ala val Leu Alia Gly Pro Gly Thr Pro Gly Asn Ser Leu Arg Ser Leu 
1050 1065 107 u 

Tyr Pro Pro Prb Pro Pro Pro Gin His Leu Gin Met Leu Pro Leu His 
1075 ! I 080 1085 

Leu Ser Thr Phe Gin Glu Glu Ser lie Ser Pro fro Gly Glu Asp lie 
1090 1095 1100 

Asp Asp Asp Ser Glu Arg Phe Lys Leu Leu Gin Glu Phe Val Tyr Gl^ 

1105 1110 lixa 

Arg Glu Gly Asn Thr Glu Glu Asp. Glu Lea Glu Glu Glu Glu Asp Leu 

? 1125 1130 * L * L J 

Pro Thr Ala ijr Ly. Leu Thr Pr Glu Asp S r Pro Ala Leu Thr Pr 

1140 1145 AWW 
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Pro ser Pro Phe Arg Asp ser Val Ala ser Gly Ser serial Pro Ser 

1155 . 1160 ■ 

ser Pro Val Ser Glu ser Val Leu Cys Thr Pro ProAsn Val Thr Tyr 

1170 . 1175 
Ala Ser Val lie Leu Arg Asp Tyr Lys Gin Ser Ser Ser Thr Leu 
1185 * 1195 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single, 

(D) TOPOLOGY : linear 

'ii) MOLECULE TYPE: cDNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZC775 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GCTAGCATAA CCCCTTGGGG CCTCTAAACG GGTCT 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDMA 

(vii) IMMEDIATE SOURCES 
(B) CLONE: 2C776 

* ■ •' 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
CTCAAGACCC GTTTAGAGGC CCCAAGGGGT TATGCTAGCT GCA 
(2) INFORMATION FOR SEQ ID NO: 5: 

t±) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

^ „n»i SUBSTITUTE SHEET 
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(vii) IMMEDIATE SOURCE: 
(B) CLONE: 2C777 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
TGAGGGGTTT TTTGCTGAAA GGAGGAACTA TGCGGCCGCA 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZC778 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
AGCTTGCGGC CGCATAGTTC CTCCTTTCAG CAAAAAACCC 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(vii) IMMEDIATE SOURCE: . 

(B) CLONE: 2 C 17 51 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
AATTCTGTGC TCTGTCAAG 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: lin ar 

(it) MOLECULE TYPE: CDNA 
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-fvii) IMMEDIATE SOURCE: 
* (B) CLONE: ZC1752 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 8; 
GATCCTTGAC AGAGCACAG 
(2) INFORMATION FOR SEQ ID NO: 9: 

(X) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 22 base, pairs 

(B) TYPE: nucleic acid 

^ (C) STRANDEDNESS : single 
(D) TOPOLOGY i linear 

(ii) MOLECULE TYPE: cDNA 

fvii) IMMEDIATE SOURCE: 
(B) CLONE: 2C2063 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GATCCAAACT AGTAAAAGAG CT 
( 2 ) INFORMATION FOR SEQ ID NO : 10 : 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

fvii) IMMEDIATE SOURCE: 
K ' ( B ) CLONE: ZC2064 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CTTTTACTAG TTTG 

(2) INFORMATION FOR SEQ ID NO: 11: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(vii) IMMEDIATE SOURCE: 
(B) CLONE: 2C2938 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GACAGAGCAC AGATTCACTA GTGAGCTCTT TTTTTTTTTT TTT 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: 2C3015 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 12 : 
TTCCATGGCA CCGTCAAGGC T 
(2) INFORMATION FOR SEQ ID NO: 13: 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: 2C3016 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 : 
AGTGATGGCA TGGACTGTGG T 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear , 

(ii) MOLECULE TYPE: cDNA 

SUBSTITUTE SHEET 
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(vii) IMMEDIATE SOURCE: 
(B) ; CLONE: ZC3652 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

■A ' ' ■. .. . - v 20 

ACATGCACCA TGCTCTGTGT , 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 base pairs 

(B) TOPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE. TYPE: cDNA 

« . . ' . 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZC3654 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

" ' 21 

AGTGATGGCA TGGACTGTGG T 
(2) INFORMATION FOR SEQ ID NO: 16: 

li) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5236 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: SN23 



(ix) FEATURE: - 

(A) NAME/KEY: CDS 

(B) LOCATION: 627.. 3344 

(X<) SEQUENCE DESCRIPTION? SEQ ID MO: 16: 
TAAGAATTTT ATAAATACTC TGGGAATTTT ATTGGTGATG CCTTTGTGTC TACAGGGCAC 
ACGTTCCAGA GAGGTCTGGT GTGAAGTGAT GGGGGACTTG TGGCTAGAGA AGCTTTTCAA 
TGGCCTTAAA CTCTGGGTCC TGCTTGAGAG AGGTCTGAGG TTCTCAACAT CAGAGCAGAG 
CTTCCACCAA GCTTTCAGAA TGCTAAGCCC CC^CTTCTCA ACACTTAGTG CTCTGATCGG 
TGCCTGCGAA CCGAGAACGG CTGCAGTCCT CTGACCTGAG ACCAATAGCT GTGTCTACCC 

^.twe-TITIITF 
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GGACTCAGCG TCCAGCTCAC CGCCACTAAC GCGCCGCGCA TTGGACACCT GATCCACACA 
CCTTCGGGCA CCAGTGAAAA ACCGCGACTT GATTTTCTGG AAGAACGCCC CCAGGGTGTG 
GGAGOGGTCG TGGAGGACCA GCAGGAGGAA GCGGAGGGGA GAGGGGCAGT AGTGGAGGCA 
GAGAAAGCGT TGAACCAGCT ' GTGTTGGCCG AAGGCACGAA ACGGCAAAAG GCAGCGGTGA 540 
GCATCTGTGT GGTTCCCGCT GGGAACCTGC AGGCAGGACC GGCGTGGGAA CGTGGCTGGC 
CCGCGGTGGA CCGCGTCTTC GCCACA ATG GTC CGG CTC CTC TTG ATT TTC TTC 

•'• ' .1 5 

rex ATG ATC TTT TTG GAG ATG TCC ATT TTG CCC AGG ATG CCT GAC AGA 
Met uS Phe SS Glu Met Ser lie Leu Pro Arg Met Pro Asp Arg 
10 15 20 

AAA GTA TTG CTG GCA GGT GCC TCG TCC CAG CGC TCC GTG GCG AGA ATG 

i£s ffi Leu Su Ala Gly Ala Ser Ser Gin Arg Ser Val Ala Arg Met 

30 35 * w 

__, rTe « TC vpr GGA GCC CTC TTC TCA GTC CAT CAC CAG CCT 

Sp S S vS iSSS S Sa Leu Phe Ser Val His His Gin Pro 

50 " 



45 



CCA GCC GAG AAG GTA. CCC GAA AGG AAG TGT GGG GAG ATC AGG. GAA CAG 
Pro Ala Glu Lys Val Pro Glti Arg Lys cys Gly Glu lie Arg Glu Gin 

60 65 

TAT GGT ATC CAG AGG GTG GAG GCC ATG TTC' CAC ACG TTG GAT AAG ATT 
Sy lie Gin Arg Val Glu Ala Met Phe His Thr Leu Asp Lys lie 
75 80 85 

eee GAC CCG GTC CTC CTG CCC AAC ATC ACT CTG GGC ACT GAG ATC 
£S Sa 2£ P?o SE Leu Leu Pro Asn lie Thr Leu Gly Ser Glu lie 
90 95 iou . 

eee GXC TCC TGC TGG CAC TCT TCA GTG.GCT CTC GAA CAG AGC ATC GAA 
Sg Sp s£ Cyl ™ His Ser Ser val Ala Leu Glu Gin Ser lie Glu 

110 115 

£ £ SS s« SIS S S-S S S! S! SfSS 

125 W° " 

140 US 

& s .s s s as ss s s a c s s 

170 " 5 180 
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1229 



saa'aaaaaagaaasga 

190 " D 



a a a s a s a ss a a a a a a a a 

205 210 



- m-,- tor n>aT GTC TCA GCA GTC CAC ACA GAA 

ATA GTC AAG CGT TAC AAC TGG ACC TAT GTC TCA ^ ^ 

He val Lys Arg Tyr Asn Trp Thr Tyr vai ser ax ^ 



220 a 23 



a aa a a a a a a a a £' a a a a 
a a a a a a a a a £ as a a a 

250 255 

■"""S2SS2SS55SSS 

Gly Glu Lys Ser Phe Asp Arg Leu Leu at? 28Q 

270 275 

— « ^m/- ^mn vrc TTC TGC GAG GGC ATG ACA GTG CGG 

aaaaaSESSSSu..,-.-^". 

285 

s 6 « a a a a a aa a a a a a 

Gly Leu Leu Ser Ala Met Arg Arg 31Q 



300 305 



saaaaaaaaaaaaaaa 

315 320 

ss a a a sa s a a a 

Tyr Glu Val Glu Ala Asn Gly Gly He Tnr JJJ ** . 34S 



330 335 



as a a aa a a a'a a« - - a S s 

350 

« « « 5 & saa a 

Asn Thr Arg Asn Pro Trp. Phe Pro cut mm * ^ 

365 370 

aasaaafiaaaaagaifts. 

380 385 

asa'aaaaaaassaaaa 

™ _ Ann 



1277 



1325 



1373 



1421 



1469 



1517 



1565 



1613 



1661 



1709 



1757 



1805 



1853 
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Met- Gly Phe Val He Asn Ala lie Tyr Ala Met Ala His Gly Leu Gin 
410 415 420 425 

AAC ATG CAC CAT GCT CTG TGT CCC GGC CAT GTG GGC CTG TGT GAT GCT 1949 
Asn Met His His Ala Leu Cys Pro Gly His Val Gly Leu Cys Asp Ala 

430 435 

ATG AAA C=C ATT GAT GGC AGG AAG CTC CTG GAT TTC CTC ATC AAA TCC 1997 

Me? Lys Pro lie Asp Gly Arg Lys Leu Leu Asp Phe Leu lie Lys ser 

445 450 . 455 

TCT TTT GTC GGA GTG TCT GGA GAG GAG GTG TGG TTC GAT GAG AAG GGG 2045 

Ser Phe Val Gly Val Ser Gly Glu Glu Val Trp Phe Asp Glu Lys Gly 
460 465 470 

GAT GCT CCC GGA AGG TAT GAC ATT ATG AAT CTG CAG TAC ACA GAA GCT 2093 
Asp Ala Pro Gly Arg Tyr Asp lie Met Asn Leu Gin Tyr Thr Glu Ala 

„ A75 480 485 

AAT CGC TAT GAC TAT GTC CAC GTG GGG ACC TGG CAT GAA; GGA' GTG CTG 2141 

Asn Arg Tyr Asp Tyr Val 

490 495 



540 



GAG TTT GTG CAG GAC GAG TTC ACC TGC AGA GCC TGT GAC CTG GGG TGG 
Glu Phe Sal Gin Asp Glu Phe Thr Cys Arg Ala Cys Asp Leu Gly Trp 
555 560 565 



TGG CCC AAC GCA GAG CTC ACA GGC TGT GAG CCC ATT CCT GTC CGT TAT 
M Asn aS-GIu Leu Thr Gly Cys Glu Pro lie Pro Val Arg Tyr 
570 57 5 580 SB!> 

CTT GAG TGG ACT GAC ATA GAA TCT ATC ATA GCC ATC GCC TTT TCT TGC 
2£ SS Trp Ser Asp He Glu Ser lie lie Ala lie Ala Phe Ser Cys 

590 5 95 

CTG GGC ATC CTC GTG ACG CTG TTT GTC ACC CTC ATC TTC GTT CTG TAC 



3 G?y He Leu Va? Thr leu Phe Val Thr L. He Phe Val Leu Tyr 

CGG GAC ACA CCC GTG GTC AAA TCC TCC AGT AGG GAG CTC TGC TAT ATC 
Arg Asp Thr Pr Val Val Lys Ser Ser S r Arg Glu Leu cys Tyr iie 

620 625 
ATT CTG GCT GGT ATT TTC CTC GGC TAT GTG TGC CCT TTC ACC CTC ATC 

lie Leu Ala Gly lie Ph Leu Gly Tyr Val cys Pr Ph Thr Leu He 

SUBSTITUTE SHEET 
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AAT CGC TAT GAC TAT GTC CAC GTG GGG ACC TOO ww. vmw* 

i£ £S-t5? val His val Gly Thr Trp His Glu Gly Val Leu 

490 j 

AAT ATT GAT GAT TAC AAA ATC CAG ATG AAC AAA. MIC GGA ATG GTA CGJ 

Asn lie Asp Asp Tyr Lys He Gin Met Asn Lys . Ser Gly Met Val Arg 

510 515 . f 2 ° 

TCT GTG TGC AGT GAG CCT TGC TTA AAG GGT CAG ATT AAG GTC ATA CGG 2237 

sS Val cys Ser Glu Pro Cys Leu Lys Gly Gin lie Lys Val He Arg 
525 530 535 

AAA GGA GAA GTG AGC TGC TGC TGG ATC TGC ACG GCC TGC AAA GAG AAT 
Lys Gly Glu Val Ser Cys Cys Trp lie Cys Thr Ala Cys Lys Glu Asn 

- - - 545 55° 



2285 



2333 



2381 
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640 * 6* 5 



•635 640 

« f i? £S S SS 55- £ a S S SS S 

Ala Lys Pro Thr Thr Thr ser cys xy* ^ c ggo ' 665 



650 655 



s aa a as a = s'a a a a a as 

670 ° 75 



sesa a s s as a a 35 a 85. 



685 690 



£ as a a 25 S as s as s a 



700 705 



ATT MT GTA CAG CI» ACA CTA GIG GTG g.JJJ JTC ATC. «G «J« 
He Ser Val Gin Leu Thr Leu Val vax rar * 
715 720 

a as s as as as fSS « = I: 



730 

- 5 a a a a a ass a a-s s a a 

1 750 755 

is aaaaa a i s,a a a.g a .a; 

765 770 

c« «c uc «= r « « $ g g s a a a a 

Pro Ala Asn PheAsn Glu Ala Lys Tyr im 79 0 



780 785 



a ssasss; a as a aa g.S a; a 5 



795 ~ 800 



a a s s a'a a a a a a a sas a 



810 815 



GCC CTG GGG TGC ATG TTT ACT CCG AAG ATG TAC ATC ATC ATT GCC AAA 
Ala Leu Gly Cys Met Phe Thr Pro Lys Met Tyr u.m ^ Q 

830 oj» 



a as a as a aa- a 

Pr Glu Arg Asn Val Arg ser ju.» j. 855 
845 850 



. rmf. rrr tgc CGC TCC AAC ACC TTC CTC 
ATG CAC CTC GGT GAT GGC AAA CTG CCG TCC CGC TCC ^ ^ ^ 

Met His Val Gly Asp Gly Lys Leu ft cys ^ 
860 M5 



2621 



2669 



2717 



2765 



2813 



2861 



2909 



2957 



3005 



3053 



3101 



3149 



3197 



3245 



eiincTiTiixr currT 
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GAGGGCAGGA GGCGAGAGGG ' CAGGAGGCGG GGGTAGGTTC GGACAACAGC TCCCATCTCA 
GACCTTGACT GTGCTGAGTC TTCAGACTCC TGGACTAAGG AAGACCCGGG GACTGACCTT 
ATGAGGGTCC CTTTCCACTG CTGTGATCCA TTGCCAGCCT GTAGTCACCC GGGATAAAGG 
CACAGTAACC TTTTGCATTC .CTGTGATTCC CTGTGTTTAA GGAAAAGGAA AGTATGAGCA 
AAGCTATCAC. CAAAAAGAGC GCCATTAGAA GTTACGGGGG AGAAAAAAAG AGAAGCAAGA 
TGATATATAA GCACAGGGCC TTGAACAAGG TGAGCGTGCT TCACAGATTC CGTATTAATG 
TACAGATACT TTTGGAGAGG AGAAAGATAA CAAGGAGTGT CAGGCCGTTT GTGAACTCAC 
TTGCACTGTG CCAACCAGGT TCTCCGCTGC CCTTCAGCAA AA 

^2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: ' 

(A) LENGTH: 906 amino acids 

(B) TYPE: amino acid 
(P) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Met Val Arg Leu Leu Leu lie Phe Phe Pro Met lie Phe Leu Glu Met 

ser lie Leu Pro Arg. Met Pro Asp Arg Lys Val Leu Leu Ala Gly Ala 

20 25 
Ser Ser Gin Arg Ser Val Ala Arg Met Asp Gly Asp Val lie, lie Gly 

35 i 40 ' 

Ala Leu Phe sir val His His Gin Pro Pro Ala Glu Lys val Pro Glu 

so 55 

Arg Lys Cys Gliy- Glu lie Arg Glu Gin Tyr Gly lie Gin Arg val Glu 
Ala Met Phe Js Thr Leu Asp Lys lie Asn Ala Asp Pro Val Leu Leu 
Pro Asn lie Thr Leu Gly Ser Glu lie Arg Asp Ser Cys Trp His Ser 

100 105 

Ser val Ala Leu Glu Gin Ser lie Glu Phe lie Arg Asp -Ser Leu lie 

115 i 120 

s« xu'tt. MP «!» U> "P oly L» X.n Ar, cy. Pro A.p Gly 

130 135 

cm Thr uu Pro ft. Sly *, Thr W Ly. « II iu oly «! II. 

145 150 
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Gly-Pro Gly ser Ser Ser Val Ala He Gin Val Gin Asn Leu Leu Gin 

165 170 1'5 

Leu Phe Asp He Pro Gin He Ala Tyr Ser Ala Thr Ser lie Asp Leu 
180 * 85 

ser Asp Lys Thr Leu Tyr Lys Tyr Phe Leu Arg val val Pro Ser Asp 
135 200 205 

Thr Leu Gin Ala Arg Ala Met Leu Asp lie val Lys Arg Tyr Asn Trp 

210 215 220 

Thr Tyr Val Ser Ala Val His Thr Glu Gly Asn Tyr Gly Glu Ser Gly 

225 230 235 240 

Met Asp Ala Phe Lys Glu Leu Ala Ala Gin Glu Gly Leu Cys lie Ala 

\ , 245 250 255 

His ser Asp Lys lie Tyr Ser Asn Ala Gly Glu Lys Ser Phe Asp Arg 

260 265 270 

Leu Leu Arg Lys Leu Arg Glu Arg Leu Pro Lys Ala Arg Val val val 
275 280 285 

Cvs Phe cys Glu Gly Met Thr Val Arg Gly Leu Leu Ser Ala Met Arg 
290 295 300 

Arg Leu Gly Val Val Gly Glu Phe Ser Leu lie Gly Ser Asp Gly Trp 
Ala Asp Arg Asp Glu val lie Glu Gly Tyr Glu Val Glu Ala Asp Gly 

325 330 

Gly He Thr He Lys Leu Gin Ser Pro Glu val Arg Ser Phe Asp Asp 
Tyr Phe Leu Lys. Leu Arg Leu Asp Thr Asn Thr Arg Asn- Pro Trp Phe 

355 360 365 

Pro Glu Phe Trp Gin His Arg Phe Gin cys Arg Leu Pro Gly His Leu 
370 375 380 

Leu Glu Asn Pro Asn Phe Lys Lys Val cys Thr Gly Asn Glu Ser Leu 

385 390 395 400 

Glu Glu Asn Tyr Val Gin Asp Ser Lys Met Gly Phe Val He Asn Ala 

405 410 

He Tyr Ala Met Ala His Gly Leu Gin Asn Met His His Ala Leu Cys 

420 425 

Pro Gly His val Gly Leu Cys Asp Ala M t Lys Pro lie Asp Gly Arg 

435 . 440 

Lys Leu Leu Asp Phe Leu lie Lys Ser Ser Phe Val Gly Val Ser Gly 
450 4SS 460 
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!u Glu val Trp Phe Asp. Glu Lys Gly Asp Ala Pro Gly Arg Tyr Asp 



6 

465 

lie Met Asn Leu 



val Gly Thr Trp 
- 500 



470 



Gin Tyr Thr Glu Ala Asn Arg Tyr Asp Tyr Val His 



485 

His Glu Gly val Leu Asn lie Asp Asp Tyr Lys lie 



505 



Gin Ket Asn Lys Ser Gly Met Val Arg Ser Val Cys Ser Glu Pro Cys 



515 



520 



Leu 



Lys Gly Gin lie Lys Val lie Arg Lys Gly Glu Val Ser Cys Cys 



530 



535 



Trp lie Cys Thr 
545 



Ala Cys Lys Glu Asn Glu Phe Val Gin Asp Glu Phe 



550 



555 



Thr cys Arg Ala Cys Asp Leu Gly Trp Trp Pro Asn Ala Glu Leu Thr 



565 



Gly Cys Glu Pro 

580 



lie Pro Val Arg Tyr Leu Glu Trp ser Asp He Glu 



585 



Ser lie lie Ala lie Ala Phe Ser Cys Leu Gly .He Leu Val Thr Leu 



595 



Phe Val Thr Leu 
610 



600 



lie Phe Val Leu Tyr Arg Asp Thr Pro Val Val Lys 



615 



620 



ser ser Ser Arg Glu Leu Cys Tyr lie lie Leu Ala Gly He Phe Leu 



625 



630 



Gly Tyr Val Cys Pro 



Phe Thr Leu lie Ala Lys Pro. Thr Thr Thr Ser 



645 



650 



Cys Tyr Leu Gin Arg 



Leu Leu Val Gly Leu Ser Ser Ala Met Cys Tyr 



660 



Ser Ala Leu Val 
675 

Ser Lys Lys Lys 
690 

Ala Gin Val He 
705 



665 



Thr Lys Thr Asn Arg lie Ala Arg lie Leu Ala Gly 



680 



lie cys Thr Arg Lys Pro Arg Phe Met Ser Ala Trp 

1 - 700 



695 



lie Ala Ser lie Leu lie Ser Val Gin Leu Thr Leu 



710 



715- 



Val val Thr Leu He lie Met 



Glu Pro Pro Met Pro He Leu Ser Tyr 



725 



730 



735 



Pro Ser He Lys 
740 



Glu val Tyr Leu He Cys Asn Thr Ser Asn Leu Gly 

745 



val Val Ala Pr Val Gly Tyr Asn Gly 
755 760 



Leu Leu He Met Ser Cys Thr 
765 
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Tyr Tyr Ala Phe Lys thr Arg Asn Val Pro Ala Asn Phe Asn Glu Ala 

770 775 780 

Lys Tyr lie Ala Phe Thr Met Tyr Thr Thr cys lie Tie Trp Leu Ala 

Phe val Pro lie Tyr Phe Gly Ser Asn Tyr Lys lie He Thr. Thr Cys ' 

805 810 815 

Phe Ala val Ser Leu Ser Val Thr Val Ala Leu Gly Cys Met Phe Thr 
820 825 8 

Pro Lys Met Tyr He He He Ala Lys Pro Glu Arg Asn Val Arg Ser 
835 840 845 

Ala Phe Thr Thr Ser Asp Val Val Arg Met. His Val Gly Asp Gly Lys 
850 855 t 860 

riu Pro cys Arg Ser Asn Thr Phe Leu Asn lie Phe Arg Arg Lys Lys 
865 870 87 5 880 

Pro Gly Ala Gly Asn Ala Lys Lys Arg Gin Pro Glu Phe Ser Pro Ser 

885 890 

Ser Gin Cys Pro Ser Ala His Ala Gin Leu 
900 905 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4095 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: SN30 

(ix) FEATURE: 

(A) NAME/KEY: COS 

(B) LOCATION: 463. .3198 \ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CCCGGGCTCC CGGCAGTGCG AGCAGCTAAG GGCTGGCCGC CGCCTCCCTG AGCTCCCCCG 
GAGCAGCCGA CCCCTGGTCG CGGCGTTCAC CTCGCCGATG CGCGGTTGGT AGGAGTGACC 
GGAGCCATTC TCTCCTCGTT GATAAGATTC CCTACCAGGA TAGGAGCCTA TCTCCCTTTC 
ACAGCAGGAC ACAGAAATCT GGCCTTCAGT ACTTTGGGAA AAGGATCTGA GACCTCCTGG 
AGCTCTGACC ACTGGCTGTC ATCTGTGGCT CTGGCCTGTG TGGGCCACTG AGCTCTACTC 
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AAKATTAAA GAGGAGGAGG GGAGATCTGT GGAATGGGCC ACCCCGTTGG CCTGCTGCAT 
TACTGAACCI GCGCTGICCA CACGTGCCCA GATCATGGGA CCCAGGGCCT GCTAGGGCTA 
GGAGCGGGGC CCAGTATTCA TCGGTCTCTA GGCCTTTCCG AA ATG TCC GGG AAG 

GGA GGC TGG GCC TGG TGG TGG GCC CGG CTG CCC CTC TGC CTA CTC CTC 

Gly Gly Trp Ala Trp Trp Trp Ala «g uu fw w y 2Q 

5 10 

m~. rvr CCT TCA TCC TTG GGA AAG CCC AAG GGT 

S % «u «■«»• » «?s »» 

25 30 
CAC CCC CAC ATG AAC TCI ATC CGA ATT GAC « GAC ATC.ACA CTG «A 
His • ro His Met. Asn Ser He Arg lie Asp eiy p sq 

V 40 45 



ss SS S SfS CSS SSS.Sj ss ss 



55 60 



m /v-e ATC CAC CGC CTG GAG GCC . ATG .CTG TTT 
GAG CTG AAG AAG GAG AAA GGC ATC CAC cju ^ phe 

Glu Leu Lys Lys Glu Lys Gly He His Arg **» 



70 75 



GCC CTG GAC CGC ATC AAC AAT GAC CCG GAC CTA CTG CCC AAC ATC ACG 
Ala Leu Asp Arg He Asn Asn Asp Pro Asp teu lQQ 
85 90 

= s 5 arss ss 5 as s ss a .s 

105 

GAG CAG TCA CTG ACC TTT GTG CGG GCG CTC ATC AAG GAC GGC £ 
Glu Gin Ser Leu Thr Phe val Arg i*> ^ Q 



120 " 5 



GAG GTC CGC TGGvGGC AGG CGG GGC CCG CCC ATC ATC ACC AAG CCC GAA 

Glu val Arg Cys Gly Arg Arg Gly Pro pro j. ^ 



135 «° 



S S s s s s as s .s s i s s s s 



150 155 



s si s s sssfts f ss s T £ s 

165 170 

S^SFSSSSSSSSSSSi 

185 

_ ^ „ e »ca TAC CAG GCC CAG GCC ATG GTG GAT ATT 
CGG GTG GTG CCC TCA GAC ACA TAW ^ uw 
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An Val Val /Pro Ser Asp Thr Tyr Gin Ala 

200 205 

■ - t . . . 

GTC CGA GCC iCTC AAG TGG AAC TAT GTG TCC 
Val Arg Ala Leu Lys Trp Asn Tyr Val Ser 
215 220 

AGC TAC GCT GAG AGT GGT GTG GAG GCC TTT 
Ser Tyr Gly Glu Ser Gly Val Glu Ala Phe 
230 235 



Gin Ala Met Val Asp He 
210 

ACA CTG GCC TCA GAG GGC 
Thr Leu Ala Ser Glu Gly 
225 

ATC CAG AAG TCC CGA GAG. 
He Gin Lys Ser Arg Glu 
240 



ur /wp GTG T gc ATT GCC CAG TCG GTG AAG ATT CCA. CGG GAA CCC 

£S S- Sy 5S C?s S Ala Gin Ser val Lys lie Pro Arg Glu Pro 
245 250 

AAG ACG GGG GAG TTC GAC AAG ATC ATC AAA CGC CTA CTG GAA ACA TCC 
™ fS £? Glu ge Asp Lys lie lie Lys Arg Leu Leu Glu Thr Ser 

•v 

AAT GCC AGG GGT ATC ATC ATC TTT GCC AAC GAG GAT GAC ATC AGG AGG 
,£n Sa Gly lie lie lie Phe Ala Asn Glu Asp Asp lie Arg Arg 

280 u 285 

GTG TTG GAG GCA GCT CGC AGG GCC AAC CAG ACC GGC CAC TTC TTT TGG 
?S Su aS Ala Arg Arg Ala Asn Gin Thr Gly His Phe Phe Trp 
295 300 303 



ATG GGT TCT GAT AGC TGG GGC TCC AAG AGT GCC 
Met Gly Ser Asp Ser Trp Gly Ser Lys Ser Ala 

310 \ 3" 

GAG GAG GTG GCC GAG GGC GCA GTC ACC ATT CTC 
Glu Glu Val Ala Glu Gly Ala Val Thr He Leu 

325 '. 330 335 

GTT CGA GGG TTC GAC CGA TAC TTC TCC AGC CGC 
Val Arg Gly Phe Asp Arg Tyr Phe Ser Ser Arg. 

»«- nrn ue ATC TGG TTT GCC GAG TTC TGG GAG GAC AAC TTC CAT TGC 
£ Sg S.S ™ A?a Glu Phe Trp Glu Asp Asn Phe His Cys 
360 365 

CAC GCG CTC AAG AAG GGA AGC CAC ATC AAG AAG TGC 
His Ala Leu Lys Lys Gly Ser His lie Lys Lys Cys 
380 385 



CCT GTG CTG CGC CTT 
Pro Val Leu Arg Leu 
320 

CCC AAG AGG ATG TCT 
Pro Lys Arg Met Ser 

340 

ACG CTG GAC AAC AAC 
Thr Leu Asp Asn Asn 
355 



AAG TTG 
Lys Leu 



ACC AAC 
Thr Asn 
390 

AAG GTG 
Lys Val 

405 

CAC GCC 
His Ala 



AGC CGC 
Ser Arg 

375 ; 

CGA GAG 
Arg Glu 



CGC ATC GGG CAG GAC TCG GCC TAT GAG CAG GAG GGG 
Sg lie Gly Gin Asp Ser Ala Tyr Glu Gin Glu Gly 
395 400 

CAG irC GTG ATT GAC GCT GTG TAC GCC ATG GGC CAC GCG CTG 
Gin Phe Val lie Asp Ala Val Tyr Ala Met Gly His Ala jeu 

410 415 -f. 

ATG CAC CGT GAC CTG TGT CCC GGC CGC GTA GGA CTC TGC CCT 
Met His Arg Asp Leu Cys Pro Gly Arg vaj. uxy i 



SU 
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6 PROTEIN-COUPLED GLUTAMATE RECEPTORS 



Background of the Invention 
The majority of nerve cell connections are 
chemical synapses. A neurotransmitter is released from 
the presynaptic terminal, typically in response to the 
arrival of an action potential in the neuron , and 
diffuses across the synaptic space to bind to membrane 
receptor proteins of the postsynaptic terminal. The 
binding of neurotransmitters to membrane receptors is 
coupled either to the generation of a permeability change 
in the postsynaptic cell or to metabolic changes. 

Neurotransmitters produce different effects 
according to the type of receptor to which they bind. In 
general , those which produce effects that are rapid in 
onset and brief in duration bind to receptors that act as 
ligand-gated ion channels, where binding almost instantly 
causes an ion flow across the membrane of the 
postsynaptic cell. Those neurotransmitters which act 
more like local chemical mediators bind to receptors that 
are coupled to intracellular enzymes, thereby producing 
effects that are slower in onset and more prolonged. 
These neurotransmitters alter the concentration of 
intracellular second messengers in the postsynaptic cell. 

\ Four second messenger systems have been linked 
to neurotransmitter or hormone receptors and have been 
studied for their roles in the control of neuronal 
excitability. They are the adenylate cyclase/cyclic AMP- 
dependent pr t in kinase system, guanylat cyclase and 
cGMP-dependent protein kinase, th inosit 1 
trisph sphat /diacyl glycer 1-pr tein kinase C system, 
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and systems which are activated by calcium i ns, such as 
th calcium/calm dulin-dependent prot in kinase system. 

Thus, binding of a transmitter to a receptor may 
activate, for example, adenylate cyclase, thereby 
5 increasing the intracellular concentration of cAMP. The 

cAMP activates protein kinases that phosphorylate 
proteins in the cells, which form ion channels, thereby 
altering the cells' electrical behavior. As with the 
ligand-gated ion channel transmitters, the effects can be 
10 either excitatory or inhibitory, and may affect the cell 

at many levels, including the pattern of gene expression. 
It is also believed that these chemical synapses, 
associated with second-messenger systems, may be involved 
in long-term changes that comprise the cellular basis of 

15 learning and memory. 

The ligand-activated membrane receptors do not 

activate the second messenger systems directly, however, 

but via a membrane-bound protein, the GTP-binding protein 

\(G protein), which binds GTP on the cytoplasmic surface 

2o of the cell membrane and thereby acts to couple adenylate 

cyclase to the membrane receptor. Neurotransmitter 
binding to the membrane receptor is believed to alter the 
conformation of the receptor protein to enable it to 
activate the G protein in the lipid bilayer, which then 

25 binds GTP at the cytoplasmic surface and produces a 

further change in the G protein to allow it to activate, 
e.g., an adenylate cyclase molecule to synthesize cAMP. 
When a ligand binds a receptor, an enzymatic cascade 

results as each receptor activates several molecules of G 
30 protein, which in turn activate more molecules of 

adenylate cyclase which convert an even larger number of 

ATPs to cAMP molecules, producing a substantial 

amplification from the initial event.. 

Glutamate, aspartate and their endogenous 
35 .derivatives are believed to b th pred minant excitatory 

n urotransmitters in the v rtebrate central nervous 

system. (Krinjrvic, Phva. Rev. 54:418-540, 1974) . 

R c ntly, glutamate has be n described as playing a 
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major, wid spread role in the control of neuroendocrine 
neurons, possibly controlling not nly the neuroendocrine 
system but other hypothalamic regions as well. Four 
major subclasses of glutamate receptors have been 
described but their characterization has until recently 
been limited to pharmacological and electrophysiological 
functional analyses. See generally , Hollman et al., 
Nature 342:643-648 (1989) and Sommer et al., Science 
249:1580-1585 (1990). Three of the receptors, the 
quisqualate (QA/AMPA) , kainate (KA) , and N-methyl-D- 
aspartate (NMDA) receptors, are believed to be directly 
coupled to cation-specific ion channels and thus are 
classified as ligandrgated ionotropic receptors . The 
fourth glutamate receptor binds some of the agonists of 
the ionotropic receptors (quisqualate and glutamate, but 
not AHPA) but has no shared antagonists, and is coupled 
to 6 protein. Thus, this receptor, referred to as the G 
protein-coupled glutamate receptor, or GlUgR, is 
pharmacologically and functionally distinct from the 
other major glutamate receptors. This receptor has also 
been termed the metabotropic receptor. 

Agonist binding to GlUgR has been shown to 
result in the activation of a number of second messenger 
systems, depending on the system studied. One of the 
best characterized is the quisqualate activation of 
phospholipase C through a G protein-coupled interaction 
that leads to the stimulation of inositol phospholipid 
metabolism. This activity has been studied in systems 
that measure the accumulation of radiolabeled inositol 
monophosphate in response to stimulation by glutamate. 
The systems typically use brain slices from regions such 
as the hippocampus, striatum, cerebral cortex and 
hypothalamus (Nicoletti, et al., Proc, Natl. Acad. Sci. \ 
USA 83:1931-1935 (1986), and Nicoletti, et al., 
Neurochem . 46:40-46 (1986)), neuronal cultures derived 
from embryonic mouse and rat cerebellum, c rpus striatum 
and cerebral cortex (Nicoletti et al., J. Neurosci. 
6:1905-1911 (1986), Sladeczeket al., Nature 317:717-719 
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(1985), Dumuis, et al., Nature 347:182-184 (1990) , and 
Drejer et al., J. Neurosci . 7:2910-2916 (1987)) and rat 
brain synaptosomes (Recasens et al . r Eur . J. Pharm . 141: 
87-93 (1987), and Recasens et al., Neurochem. Int«- 
13:463-467 (1988)). A major disadvantage of each of 
these model systems is the difficulty in analyzing the 
pharmacological and functional activities of GIUqR in an 
environment where other glutamate receptors and. 6 
protein-coupled receptors such as muscarinic and 
serotonin receptors are also present. 

The Xenopus oocyte system has been used to 
identify GlUgR as a member of the family of G protein- 
coupled receptors. An endogenous inositol triphosphate 
second messenger-mediated pathway in the oocyte allows 
the detection of GlUgR after injection of total rat brain 
mRNA, in that the oocyte responds to ligand via the 
oocyte G protein-coupled PLC-mediated activation of a 

chloride channel that can be detected as a delayed, 
oscillatory current by voltage-clamp recording (Houamed 
et al.. Nature 310:318-321 (1984), Gundersqn et al., 
Proe. Roval Soc. B221: 127-143 (1984) , Dascal et al., JSsXj. 
Brain Res. 1:301-309 (1986), Verdoorn et al^, Science 
238:1114-1116 (1987), Sugiyama et al. , Ha£u£g 325:531^533 

(1987) , Hirono et al., Naufos. Res. 6:106-114 (1988), 
ir^A^^T, rH ngl ort< nfl f Mgl . Pharmacol . 34 : 298-307 

(1988) , and Sugiyama et al., Neuron 3:129-132 (1989)). 
Injection of region-specific brain mRNA and of size 
fractionated mRNA have suggested that GlUgR may be a large 
mRNA (6-7 kb) and that it is enriched in the cerebellum 
(Fong et al*, svnapse 2:657-665 (1988) and Horikoshi et 
W 1V. Wauroaci. Lett. 105:340-343 (1989)). 

There remains considerable need in the art for 
isolated and purified GlxigR, as well as systems capable of 
expressing GliigR separate from other neurotransmitter 
receptors. Further, it would b desirabl t 
specifically identify the presence of GlxigR in cells and 
tissues, thereby avoiding the time-consuming, complex and 
nonspecific functional electrophysiological and 
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phannac 1 gical assays. It v uld also be d sirable t 
screen and dev lop new agonists and/ or antag nists 
specific for Glu^, but to date this has not been 
practical. Quite surprisingly , the present invention 
fulfills these and other related needs. 

gumgry p£ the TnvqriUwi 

The present invention provides isolated and 
substantially pure preparations of mammalian G protein- 
coupled glutamate receptors and fragments thereof • In 
preferred embodiments the receptors are coupled to a G 
protein in vertebrate cells, bind glutamate and 
quisqualate and thereby activate phospholipase C, and are 
capable of stimulating inositol phospholipid metabolism. 
Having provided such receptors in isolated and purified 
form, the invention also provides antibodies to the 
receptors, in the form of antisera and/or monoclonal 
antibodies. 

In another aspect the invention provides the 
ability to produce the mammalian G protein-coupled 
glutamate receptors and polypeptides or fragments thereof 
by recombinant means, preferably in cultured eukarybtic 
cells. The expressed receptors or fragments may or may 
not have the biological activity of corresponding native 
receptors, and may or may not be coupled to a G protein 
in the cell used for expression. Accordingly, isolated 
and purified polynucleotides are described which code for 
the receptors and fragments thereof, where the 
polynucleotides may be in the form of DNA, such as cONA, 
or RNA. Based on these sequences probes may be used to 
hybridize and identify these and related genes which 
encode mammalian G protein-coupled glutamate receptors. 
The probes may be full length cDNA or as small as from 14 
to 25 nucleotide, more of ten though from about 40 to 
ab ut 50 r m re nuclebtid s. 

In relat d emb dim nts the invent! n c ncems 
DNA c nstructs which c mprise a transcriptional pr m t r, 
a DNA sequ nee which enc des the recept r r fragment, 
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and a transcriptional t rminator/ each operably linked 
for expression of the rec ptor. For expression th 
construct may also contain at least one signal sequence. 
The constructs are preferably used to transform or 
transfect eukaryotic cells, more preferably mammalian 
cells which do not express endogenous G protein-coupled 
glutamate receptors. When bound by an appropriate ligand 
such as glutamate or quisqualate, the receptor may 
activate phospholipase C in the host cell via coupling to 
G protein. Further, for large scale production the 
expressed receptor may also be isolated from the cells 
by, fbr example, immunoaf finity purification. 

Cells which express the G protein-coupled 
glutamate receptors may also be used to identify 
compounds which can alter the receptor-mediated 
metabolism of a eukaryotic cell. Compounds may be 
screened for binding to the receptor, and/or for 
effecting a change in receptor-mediated metabolism in the 
host cell. Agonists and/or antagonists of the G protein- 
coupled glutamate receptors may also be screened in cell- 
free systems using purified receptors or binding 
fragments thereof for the effect on ligand-receptor 
interaction, or using reconstituted systems such as 
micelles which also provide the ability to assess 
metabolic changes. 

In yet other embodiments the invention relates 
to methods for diagnosis, where the presence of a 
mammalian G protein-coupled glutamate receptor in a 
biological sample may be determined. For example, a 
monospecific antibody which specifically binds a G 
proteini-coupled glutamate receptor is incubated with the 
sample tinder conditions conducive to immune complex 
formation, which complexes are then detected, typically 
by means of a label such as an enzyme, fluorophore, 
radionuclide, chemiluminescer, particle, or a second 
labeled; antibody* Thus, means aw provided for 
immunohkstochemical staining of tissues, including brain 

tissues, for the subject receptors. 

I 
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Brief Desc ription of the Figures 

5 Figure 1 illustrates the construction of 

plasmid pVEGT, where Fig. 1A shows the construction of 
. pVEG, Fig. IB shows the constinictibn of pVEG 1 and Fig. 1C 
shows pVEGT 1 • Symbols used are T7 pro, the T7 promoter; 
Tl and T2, synthetic and native T7 terminators, 
10 respectively; M13, M13 intergenic region; the parentheses 

indicate a restriction site destroyed in vector 
construction; and pA is the Aspergillus hiaer 
polyadenylate sequence. 

. Figure 2 illustrates representative responses 
15 from voltage-clamp assays of oocytes injected with RNA 

from positive pools. 

Figure 3 illustrates a partial restriction map 

of clone 45-A. 

Figure 4 illustrates this cloning of the 
20 receptor cDNA present in clone 45-A into Zem228R. 

Figure 5 illustrates the DNA sequence and 
\ deduced amino acid sequence of clone 45-A (corresponding 

to Sequence ID Nos. 1 and 2). Numbers below the line 
refer to amino acid sequence, numbers above the line 
25 refer to nucleotide number. Putative transmembrane 

domains have been overlined, and putative N-l inked 
glycosylation sites are indicated by closed circles. 

Figure 6 illustrates a representative dose 
response curve for varying concentrations of L-glutamic 
30 acid. Error bars, where larger than the symbols, 

represent SEM. 

Figure 7 illustrates the DNA sequence and 
deduced amino acid sequence of a subtype lb glutamate 
receptor clone (Sequence ID Nos. 16 and 17). Numbers 
35 below the lin4 refer to amino acid sequeric . Numbers 

above the line refer t nucleotid sequence. 

Figure 8 illustrates the DNA sequ nc and 
deduced amin acid sequenc of a subtype 2a glutamate 
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receptor clone (Sequence ID Nos< 18 and 19). Numb rs 
below th line r fer to amino acid sequence. Numbers 
above the line refer to nucleotide sequence. 

Figure 9 illustrates the DNA sequence of a 
partial subtype 2b glutamate receptor clone (Sequence ID 
No. 20), Numbers refer to the nucleotide sequence. 



itesei-iptlon «f the specific Embodiments 

GlUgR is a family of G protein-coupled membrane 
receptors for the neurotransmitter glutamate. As 
glutamate has been described as having a major role in 
the control of neurons, particularly neuroendocrine 
neurons, Glu G R may play a critical role in effectuating 
such control. Consequently, the development of agonists 
and antagonists of the GlUgR-ligand interaction and 
GlUgR-mediated metabolism is of great interest. 

The present invention presents the means to 
identify agonists and antagonists of the GlUgR-ligand 
interaction by providing isolated GlUgR. The term 
"GIUqR" refers to any protein either derived from a 
naturally occurring GlUgR, or which shares significant 
structural and functional characteristics peculiar to a 
naturally occurring Glu 6 R. Such a receptor may result 
when regions of a naturally occurring receptor are 
deleted or replaced in such a manner as to yield a 
protein having a similar function. Homologous sequences, 
allelic variations, and natural mutants; induced point, 
deletion, and insertion mutants; alternatively expressed 
variants; proteins encoded by DNA which hybridize under 
high or low stringency conditions to nucleic acids which 
encode naturally occurring Glu^R-encoding nucleic acids; 
proteins r trieved from naturally occurring materials; 
and clos ly related prot ins r trieved by ahtisera 
directed against GlUgR proteins are also included. 
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analog, or chimeric Glu^ as generally described in U.S. 
Pat. No. 4,859,609, incorporated by ref r nee herein. 
The molecule may be chemically synthesized or may occur 
in nature. Ligands may be grouped into agonists and 
antagonists. Agonists are those molecules whose binding 
to a receptor induces the response pathway within a cell. 
Antagonists are those molecules whose binding to a 
receptor blocks the response pathway within a cell. 

By "isolated" GIUqR is meant to refer to a GIUqR 
which is in other than its native environment such as a 
neuron, including, for example, substantially pure Glu^R 
as defined here inbe low. More generally, isolated is 
meant to Include a GIUqR as a heterologous component of a 
cell or other system. For example, a Glu^R may be 
expressed by a cell transfected with a DNA construct 
which encodes the GlUgR, separated from the cell and added 
to micelles which contain other selected receptors. In 
another example described below, a Glu^JR is expressed by a 
cell which has been co-trans fected with a gene encoding 
muscarinic receptor. Thus, in this context, the , 
environment of the isolated GlUgR is not as it occurs in 
its native state, particularly when it is present in a 
system as an exogenous component. 

The invention provides cloned GIUqR coding 
sequences which are capable of expressing GlUgR proteins. 
Complementary DNA encoding GlUgR may be obtained by 
constructing a cDNA library from mRNA from, for example, 
brain tissue. The library may be screened by 
transcribing the library and injecting the resulting mRNA 
into oocytes and detecting, by functional assays, those 
oocytes which express the GIUqR. Alternatively, the 
clones may be screened with a complementary labeled 
oligonucleotide probe/ 

The present invention relates to successfully 
isolating a cDNA encoding a GlUgR. Functional cloning of 
Glu c R was accomplished by substantial modifications and 
improvem hts to a number of cDNA cl nirig and molecular 
biology technigu s. Initially, an enriched source f 
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GlUgR mRNA pr pared by sucros gradi nt centrifugation f 
>4kb length rat cerebellum poly ( A) + mRNA was used as 
template for cDNA synthesis. Further, a cDNA cloning 
vector that was employed included a poly (A) tail, thereby 
increasing by 40-fold the translational efficiency of the 
transcription product of the cDNA insert and a polyl inker 
site to allow the directional cloning of the cDNA into 
the vector between the promoter and the poly (A) tail. 
Vector construction for directional cloning is described 
- in co-pending U.S. S.N, 07/320,191, incorporated herein by 
reference. The cDNA cloning vector also was used with 
two transcriptional terminators, in tandem, following the 
poly (A) sequences, efficiently generating a unit length 
transcript product without non-coding plasmid or viral 
sequences, and without requiring a restriction 
endonuclease to linearize the UNA template (a standard 
practice' that will often prevent functional cloning 
strategies from working due to the presence of the 
endonuclease site within the coding region of the cDNA) . 
The cDNA synthesis strategy maximized insert size and 
recreation of the 5* ends of the cDNA's, without 
introduction of homopolymer tails. cDNA inserts were 
size-selected to be greater than 4 kb in length before 
insertion into the vector. A library of 1G 6 cDNA inserts 
in pools of 100, 000 was replica plated to reduce the 
number of amplification steps in the fractionation of 
sequentially smaller pools. Moreover, ml muscarinic cDNA 
(another 6 protein-coupled receptor coupled to 
phosphoinositol metabolism) template was included in 
transcription reactions of the subtract ionated pools so 
that before injection the In vitro transcripts from each 
pool could be assayed by Northern analysis to assess 
relative quantity and quality of the mRNA, and by 
voltage-clamp of oocytes as an internal positive control 
for each oocyte not r sponding to quisqualate r 
glutamate. Th inclusion fa dilution of SEAP-VEGT' (a 
seer ted form of alkaline phosphatase) t mplate in 
transcriptions was also employed s that oocytes selected 
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for! voltage-clamp analysis were those synthesizing higher 
lev Is of th co-inject d GlUgR mRNA. And further, low 
noise electrical recording techniques were used to 
monitor the small signals initially generated from rare 
transcripts.; 

The above-described methods were used to 
isolate a cDNA clone encoding a GlUgR designated "subtype 
la." Oligonucleotide probes based on the sequence of the 
subtype la clone were used to probe additional brain and 
cerebellum cDNA libraries. These libraries yielded 
clones encoding additional subtypes, which were 
designated lb, 2a and 2b. 

With the GlUgR and cDNA clones thereof provided 
herein, nucleotide and amino acid sequences may be 
determined by conventional means, such as by dideoxy 
sequencing. See generally, Sambrook et al., Molecular 
Cloning. A Laboratory Manual . 2d ed. , Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY, 1989, 
incorporated by reference herein. Genomic or cDNA 
sequences encoding Glu G R and homologous receptors of this 
family may be obtained from libraries prepared from other 
mammalian species according to well known procedures. 
For instance, using oligonucleotide probes from rodent 
GlUgR, i such as whole length cDNA or shorter probes of at 
least about fourteen nucleotides to twenty-five or more 
nucleotides in length; often as many as 40 to 50 
nucleotides, DNA sequences encoding GlUgR of other 
mammalian species, such as lagomorph, avian, bovine, 
porcine, murine, etc. may be obtained. If partial clones 
are obtained, it is necessary to join them in proper 
reading frame to produce a full length clone, using such 
techniques as endonuclease cleavage, ligation and loopout 
mutagenesis. 

A DNA sequence encoding GlUgR is inserted into a 
suitable expression vector, which in turn is used to 
transfect ukaryotic cells. Expr ssion v ct rs f or us ^ 
in carrying out the pr sent inv ntion will comprise a 
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promoter capable of directing the transcription of a 
cloned DNA and a transcriptional terminator. 

To direct proteins of the present invention for 
transport to the plasma membrane, at least one signal 
sequence is operably linked to the DNA sequence of 
interest. The signal sequence may be derived from the 
GlUgR coding sequence, from other signal sequences 
described in the art, or synthesized de novo. 

Host cells for use in practicing the present 
invention include mammalian, avian, plant, insect and 
fungal cells, but preferably mammalian cells. Fungal 
cells, including species of yeast (e.g., Saccharomyces 
spp. , particularly cerevisiae . schlzosaccharomvces 
spp.) or filamentous fungi (e.g., Aspergillus spp., 
Neurosoora spp. ) may be used as host cells within the 
present invention. Suitable yeast vectors for use in the 
present invention include YRp7 (Struhl et al., Proc t 
Watl. Acad. flcl. USA 76: 1035-1039, 1978), YEpl3 (Broach 
et al. . Gene 8: 121-133, 1979), POT vectors (Kawasaki et 
al, U.S. Patent No. 4,931,373, which is incorporated by 
reference herein), pJDB24? and pJDB219 (Beggs, fia£are 
275:104-108, 1978) and derivatives thereof. Such vectors 
will generally include a selectable marker, which may be 
one of any number of genes that exhibit a dominant 
phenotype for which a phenotypic assay exists to enable 
transformants to be selected. Preferred selectable 
markers are those that complement host cell auxotrophy, 
provide antibiotic resistance or enable a cell to utilize 
specific carbon sources, and include IEQ2 (Broach et al. , 
ibid. \ URA3 (Bot stein et al., Gens 8: 17, 1979); H££3_ 
(Struhl et al., ibid.) or £QH (Kawasaki et al. , ibid.) . 
Another suitable selectable marker is the £AT gene, which 

♦ 

confers chloramphenicol resistance on yeast cells. 

Additional vectors, promoters and terminators 
for use in xpressing the recept r of th inv ntion in 
y ast are w 11 kn wn in the art and ar reviewed by, for 
example, Emr, M*th. Enzvmbl. 185:231-279, (1990), 
incorporat d herein by referenc . The receptors of the 
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invention may be expressed in Aspergillus spp. (McKnight 
and Upshall, d scribed in U.S. Patent 4,935,349/ which is 
incorporated herein by reference). Useful promoters 
include those derived from JteperqUlug rilflulWP 
glycolytic genes, such as the ADH3 promoter (McKnight et 
al., EMBO J, 4:2093-2099, 1985) and the £eiA promoter. 
An example of a suitable terminator is the ADH3 
terminator (McKnight et al. , ibid. ) ♦ Techniques for 
transforming fungi are well known in the literature, and 
have been described, for instance by Beggs (ibid.), 
Hinnen et al. ( Proc. Natl. Acad. Sci. USA 75:1929-1933, 
1978), Yelton et al. ( Proc. Natl . Acad. Sci. USA 8i:1740- 
1747, 1984), and Russell (HatilCfi 301:167-169, 1983) each 
of which are incorporated herein by reference. 

A variety of higher eukaryotic cells may serve 
as host cells for expression of the GliigR, although not 
all cell lines will be capable of functional coupling of 
the receptor to the cell's second messenger systems. 
Cultured mammalian cells, such. as BHK, CHO, Yl (Shapiro 
et al . . TIPS SuppI. 43-46 (1989)), NG108-15 (Dawson et 

ai.v W9ttro8<?l?ncg Appy9a<?h?d Thrpuqh Cell Ciiltmre, vol. 

2, pages 89-114 (1989)), N1E-115 (Liles et al., J. Biol. 
Chem, 261:5307-5313 (1986)), PC 12 and COS-1 (ATCC CRL 
1650) are preferred. Preferred BHK cell lines are the tk" 
ts!3 BHK cell line (Waechter and Baserga, Proc. Natl. 
Acad. Sci. USA 79:1106-1110 (1982)) and the BHK 570 cell 
line (deposited with the American Type Culture 
Collection, 12301 Parklawn Dr., Rockville, MD. under 
accession number CRL 10314) . A tk" BHK cell line is 
available from the ATCC under accession number CRL 1632. 

Mammalian expression vectors for use in 
carrying out the present invention will include a 
promoter capable of directing the transcription of a 
cloned gene or cDNA. Preferred promoters include viral 
promot rs and cellular promot rs. Viral prom t rs 
include the immediat arly cytomegalovirus promot r 
(Boshart et al., Cell 41: 521-530, 1985) and the SV40 
promot r (Subramani et al., Mol. Cell. Biol. 1: 854-864, 



1981). Cellular promoters include the mouse 
metallothionein-1 promoter (Palmiter et al. , U.S. Pat nt 
No. 4,579,821), a mouse promoter (Bergman et al., 

Proe. Natl. A cad. Sci. PSA 81: 7041-7045, 1983; Grant et 
al. , wuc Acids Res. 15: 5496, 1987) and a mouse V H 
promoter (Loh et al. , Cell 33: 85-93, 1983). A 
particularly preferred promoter is the major late 
promoter from Adenovirus 2 (Kaufman and Sharp, Molt CeXJ,t 
Biol . 2: 1304-13199, 1982). Such expression vectors may 
also contain a set of UNA splice sites located downstream 
from the promoter and upstream from the DNA sequence 
encoding the peptide or protein of interest. Preferred 
SNA splice sites may be obtained from adenovirus and/or 
immunoglobulin genes. 

Also contained in the expression vectors is a 
poiyadenylatibn signal located downstream of the coding 
sequence of interest . Polyadenylation signals include 
the early or late polyadenylation signals from SV4 0 
(Kaufman and Sharp, ibid.) , the polyadenylation signal 
from the Adenovirus 5 E1B region and the human growth 
hormone gene terminator (DeNoto et al., Nuc, Ac&fls Res. 
9: 3719-3730, 1981). The expression vectors may include 
a noncoding viral leader sequence, such as the Adenovirus 
2 tripartite leader, located between the promoter and the 
RNA splice sites. Preferred vectors may also include 
enhancer sequences, such as the SV40 enhancer and the 
mouse n enhancer (Gillies, £§11 33: 717-728, 1983) . 
Expression vectors may also include sequences encoding 
the adenovirus VA RNAs. 

Cloned DNA sequences may be introduced into 
cultured mammalian cells by, for example, calcium 
phosphate-mediated transf action (Wigler et al., Qsll 14: 
725, 1978; Corsaro and Pearson, somatic Cell Genetics 7: 
603, 1981; Graham and Van der Eb, yiroloov 52: 456, 
1973.) Oth r techniques for introducing cloned DNA 
s quences int mammalian cells, such as electroporation 
(N umann et al., WBQJL. 1: 841-845, 1982) , may also be 
used, in order to identify cells that have int grated 
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the cloned DNA, a s lectable marker is gen rally 
introduc d into the cells along with th gene or cDNA of 
interest. Preferred selectable markers for use in 
cultured mammalian cells include genes that confer 
resistance to drugs, such as neomycin, hygromycin, and 
methotrexate. The selectable marker may , be an 
amplifiable selectable marker* Preferred amplifiable 
selectable markers are the DSFR gene and the neomycin 
resistance gene. Selectable markers are reviewed by 
Thilly (Mainmaliap Cell Tgchpglgqy, Butterworth 
Publishers, Stoneham, HA, which is incorporated herein by 
reference). The choice of selectable markers is well 
within the level of ordinary skill in the art. 

Selectable markers may be introduced into the 
cell on a separate plasmid at the same time as the gene 
of interest, or they may be introduced on the same 
plasmid. If on the same plasmid, the. selectable marker 
and the gene of interest may be under the control of 
different promoters or the same promoter, the latter 
arrangement producing a dicistronic message. Constructs 
of this type are known in the art (for example, Levinson 
and Simons en, U.S. Patent No. 4,713,339). It may also be 
advantageous to add additional DNA, known as "carrier 
DNA" to the mixture which is introduced into the cells. 

Trans feet ed mammalian cells are allowed to grow 
for a period of time, typically 1-2 days, to begin 
expressing the DNA sequence (s) of interest. Drug 
selection is then applied to select for growth of cells 
that are expressing the selectable marker in a stable 
fashion. Transfected cells may also be selected in the 
presence of antagonist to inhibit the activity of the 
receptor. Suitable antagonists in this context include 
D, L, 2-amino-3-phosphonoprqpiohate. For cells that have 
been transfected with an amplifiable selectable marker 
the drug c ncentration may be increas d in a st pwise 
manner t select for increas d c py number of th cl ned 
sequences, thereby increasing expression 1 vels. 
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Promoters, terminators and methods suitable for 
introducing expression vectors ncoding recombinant GlUgR 
into plant, avian and insect cells are known in the art. 
The use of baculoviruses, for example, as vectors for 
expressing heterologous DNA sequences in insect cells has 
been reviewed by Atkinson et al. (PegUc, Scj. 28: 
215-224,1996);. The use of Aq7T Qbaeterium rhjiSQqenes as 
vectors for expressing genes in plant cells has been 
reviewed by Sinkar et al. f.T. Bioaci> fBaralaore) 11: 

47-58 ^ 1987). 

Host cells containing DNA constructs of the 

present invention are then cultured to produce 

recombinant GlUgR. The cells are cultured according to 

accepted methods in a culture medium containing nutrients 

required for growth of mammalian or other host cells. A 

variety of suitable media are known in the art and 

generally include a carbon source, a nitrogen source, 

essential amino acids, vitamins, minerals and growth 

factors. The growth medium will generally select for 

cells containing the DNA construct by, for example, drug 

selection or deficiency in an essential nutrient which is 

complemented by. the selectable marker on the DNA 

construct or co-transfected with the DNA construct. 

Transfected cells expressing a cloned GlUgR can 

be detected by several methods. By transfecting cells 

with an expression vector containing expression units for 

both the GlUgR and a reporter gene (e.g. luciferase) , the 

activity of the reporter gene provides an indicator of 

expression of the cotransfected GlUgR clone. By including 

one or more cyclic AMP response elements (CRE) in the 

reporter gene expression unit, clones encoding receptors 

coupled! to either the stimulation or inhibition of the 

second messenger adenylate cyclase can be identified by a 

change in reporter gene expression in response to added 

ligandJ DNA constructs comprising a linked CRE and 

report r gene ar known in the art. see, for example, 

Mellon et al., p™«- Natl. Acad. Sci. PSA 86: 4887-4891 

(1989) ,| incorporated h rein by reference. Cell lines 
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expressing functional rec ptors can also be d tect d by 
electrophysiological measurements of agonist- induced 
channel activity. Receptor activity can also be assayed 
by measuring cytosolic free calcium concentrations in 
transfected cells. See, for example, Thastrup et al«, 
ftrpq. Ac a q f Sgj, ySft 87: 2466-2470 (1990) and 

Picard et al. . Science 247: 327-329 (1990), which are 
incorporated herein by reference. A preferred method for 
measuring cytosolic free calcium is by scanning cells 
with a fluorescent microscope coupled to a video camera. 
The cells are injected with a fluorescent Ca 2 * indicator 
(e.g. Fluo-3 or Fura-2, Molecular Probes, Inc., Eugene, 
OR) and exposed to ligand. 

The GlUgR produced according to the present 
invention may be purified from the recombinant expression 
systems or other sources using purification protocols 
that employ techniques generally available to those 
skilled in the art. The most convenient sources for 
obtaining large quantities of GlUgR are cells which 
express the recombinant receptor. However, other , 
sources, such as tissues, particularly brain tissues of 
the cerebellum which contain GlUgR, may also be employed. 

Purification may be achieved by conventional 
chemical purification means, such as liquid 
chromatography, lectin affinity chromatography, gradient 
centrifugation, and gel electrophoresis, among others. 
Methods of protein purification are known in the art (see 
generally, Scopes, R. , Prrtgin PurAf Ag^iffP, 
Springer-Verlag, NY (1982), which is incorporated herein 
by reference) and may be applied to the purification of 
the GIUqR and particularly the recombinantly produced 
GIUqR described herein. In a preferred embodiment 
immunoaf finity chromatography is employed using 
antibodies directed against Glu^Jt as herein described. In 
another method of purification, a recombinant gene 
encoding GIUqR or portions thereof can b modified at the 
amin terminus, just b hind a signal p ptide, with a 
sequence coding for a small hydrophilic p ptide, such as 
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described in U.S. Patent Nos. 4,703,004 and 4,782,137, 
incorporated herein by reference. Specific antib dies 
for the peptide facilitate rapid purification of GlUgR, 
and the short peptide can then be removed with 

enterokinase . 

Thus, as discussed above, the present invention 
provides GlUgR isolated from its natural cellular 
environment, substantially free of other G protein- 
coupled glutamate receptors. Purified GlUgR is also 
provided, substantially pure GIUqR of at least about 50% 
is preferred, at least about 70-80% more preferred, and 
95-99% or more homogeneity most preferred, particularly 
for pharmaceutical uses. Once purified, partially or to 
homogeneity, as desired, the recombinant GltigR or native 
GlUgR may then be used to generate antibodies, in assay 

procedures, etc. 

In another aspect, the invention concerns 
polypeptides and fragments of GlUgR. Polypeptides and 
fragments of GlUgR may be isolated from recombinant 
expression systems or may be synthesized by the solid 
phase method of Merrifield, Fed. Proc. 21:412 (1962), 
Merrifield, -t. am. Chen. Soc. 85:2149 (1963) , or Barany 
and Merrifield, in The Peptides, vol. 2, pp. 1-284 (1979) 
Academic Press, NY, each of which are incorporated herein 
by reference, or by use of an automated peptide 
synthesizer. By "polypeptides" is meant a sequence of at 
least about 3 amino acids, typically 6 or more, up to 
100-200 amino acids or more, including entire proteins. 
For example, the portion (s) of GlUgR proteins which bind 
ligand may be identified by a variety of methods, such as 
by treating purified receptor with a protease or a 
chemical agent to fragment it and determine which 
fragment is able to bind to labeled glutamate in a ligand 
blot. Polypeptides may then be synthesized and used as 
antig n, to inhibit ligand-GlUgR interaction, etc. it 
should be understood that as used herein, reference to 
GlUgR is meant to include the proteins, polypeptides, and 
fragments thereof unl ss the context indicat s oth rwise. 
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In another aspect, the invention provides means 
for regulating the GlUgR-ligand int raction, and thus 
treating, therapeutically and/or prophylactically, a 
disorder which can be linked directly or indirectly to a 
GlUgR or to its ligands, such as glutamate and other 
endogenous excitatory amino acids. By virtue of having 
the receptors of the invention, agonists or antagonists 
may be identified which stimulate or inhibit the 
interaction of ligand with a GlUgR. With either agonists 
or antagonists the metabolism and reactivity of cells 
which express the receptor are controlled, thereby 
providing a means to abate or in some instances prevent 
the disease of interest » 

Thus, the invention provides screening 
procedures for identifying agonists or antagonists of 
events mediated by the ligand-rGlUQR interaction. Such 
screening assays may employ a wide variety of formats, 
depending to some extent on which aspect of the 
ligand/receptor/G protein interaction is targeted. For 
example, such assays may be designed to identify 
compounds which bind to the receptor and thereby block or 
inhibit interaction of the receptor with the ligand. 
Other assays can be designed to identify compounds which 
can substitute for ligand and therefore stimulate GlUpR- 
mediated intracellular pathways. Yet other assays can be 
used to identify compounds which inhibit or facilitate 
the association of Glt^R to G protein and thereby mediate 
the cellular response to GIUqR ligand. 

In one functional screening assay, the 
initiation of fertilization activation events are 
monitored in eggs which have been injected with, e.g., 
mRNA which codes, for GIUqR and subsequently exposed to 
selected compounds which are being screened, in 
conjunction with or apart from an appropriate ligand. 
See generally, Kline et al., Science 241:464-467 (1988), 
incorporated herein by reference. Oocytes ^inj ected with 
mRNA coding for GlUgR can also be assayed by measurem nt 
of fr cytos lie Ca 2 * as described abov . 
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Another screening assay is based on th use of 
mammalian cell lines which express GlUgR functionally 
coupled to a mammalian G protein. In this assay, 
compounds are screened for their relative affinity as 
receptor agonists or antagonists by comparing the 
relative receptor occupancy to the extent of ligand 
induced stimulation or inhibition of second messenger 
metabolism. For example, activation of phospholipase C 
leads to increased inositol monophosphate metabolism. 
Means for measuring inositol monophosphate metabolism are 
generally described in Subers and Nathanson, ? t M°lt 
pell. Cardiol. 20:131-140 (1988}, incorporated herein by 
reference. As noted previously, receptor subtypes that 
are coupled to the stimulation or inhibition of the 
second messenger adenylate cyclase can be used in assay 
systems wherein reporter gene (e.g. lucif erase) activity 
is linked to receptor-ligand interactions. 

The screening procedure can be used to identify 
reagents such as antibodies which specifically bind to 
the receptors and substantially affect their interaction 
with ligand, for example. The antibodies may be 
monoclonal or polyclonal, in the form of antiserum or 
monospecific antibodies, such as purified antiserum or 
monoclonal antibodies or mixtures thereof. For 
administration to humans, e.g., as a component of a 
composition for in vivo diagnosis or imaging, the 
antibodies are preferably substantially human to minimize 
immunogenicity and are in substantially pure form. By 
substantially human is meant generally containing at 
least about 70% human antibody sequence, preferably at 
least about 80% human, and most preferably at least about 
90-95% or more of a human antibody sequence to minimize 
immunogenicity in humans. 

Antibodies which bind Glu^R may be produced by a 
vari ty.of means. The production of non-human antisera 
or monoclonal antibodies, e.g., murine, lagomorpha, 
equine, tc. is w 11 known and may be accomplished by, 
for xample, immunizing the animal with the receptor 
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molecule or a preparati n c ntaining a desired portion of 
the receptor molecule, such as that domain or domains 
which contributes to ligand binding. Receptor subtype- 
specific antibodies can be generated by immunizing with 
5 specific peptides. Small peptides (e.g., about 14-20 

amino acids) can be coupled to; keyhole limpet hemocyanin, 
for example,; to enhance immunogenicity. For the 
production of monoclonal antibodies, antibody producing 
cells obtained from immunized animals are immortalized 

10 and screened; or screened first for the production of 

antibody which binds to the receptor protein and then 
immortalized. As the generation of human monoclonal 
antibodies to human GlUgR antigen may be difficult with 
conventional techniques, it may be desirable to transfer 

15 antigen binding regions of the non-human antibodies, e.g. 

the F(ab') 2 or hypervariable regions, to human constant 
regions (Fc) or framework regions by recombinant DNA 
techniques to produce substantially human molecules. 
Such methods are generally known in the art and are 

20 described in, for example, U.S. Patent No. 4,816,397 and 

EP publications 173,494 and 239,400, which are 
incorporated herein by reference. Alternatively, one may 
isolate DNA sequences which code for a human monoclonal 
antibody or portions thereof that specifically bind to 

25 the human receptor protein by screening a DNA library 

from human B cells according to the general protocol 
outlined by Huse et al . , fiEisnCS 246:1275-1281 (1989), 
incorporated herein by reference, and then cloning and 
amplifying the sequences which encode the antibody (or 

30 binding fragment) of the desired specificity. 

! In other embodiments, the invention provides 
screening assays conducted in vitro with cells which 
express the receptor. For example, the DNA which encodes 
the receptor or selected portions thereof may be 

35 trans fee ted into an established cell line, e.g., a 

mammalian cell lin such as BHK or CHO, using procedures 
known in the art ( see , e.g. . Sambrook et al . , Molecular 
Cloning. A Laboratory Manual. 2d d. , Cold Spring Harbor 
f ' • ■ 
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Laboratory Pr ss f Cold Spring Harbor, N.Y., 1989, which 
is incorporated herein by reference) . The receptor is 
then expressed by the cultured cells, and selected agents 
are screened for the desired effect on the cell, 
5 separately or in conjunction with an appropriate ligand 

such as glutamate or quisqualate. Means for amplifying 
nucleic acid sequences which may be employed to amplify 
sequences encoding the receptor or portions thereof are 
described in U.S. Pat. Nos. 4,683,195 and 4,683,202, 

10 incorporated herein by reference. 

In yet another aspect, the screening assays 
provided by the invention relate to transgenic mammals 
whose germ cells and somatic cells contain a nucleotide 
sequence encoding GlUgR protein or a selected portion of 

15 the receptor which, e.g., binds ligand, GTP binding 

protein, or the like. There are several means by which a 
sequence encoding, for example, the human GlUgR may be 
introduced into a non-human mammalian embryo, some of 
which are described in, e.g., U.S. Patent No. 4,736,866, 

20 . Jaenisch, ficifinsfi 240-1468-1474 (1988) and westphal et 

al., f^«i. p«v- cell Biol . 5:181^196 (1989) , which are 
incorporated herein by reference. The. animal's cells 

then express the receptor and thus may be used as a 
convenient model for testing or screening selected 

25 agonists or antagonists. 

In another aspect the invention concerns 
diagnostic methods and compositions. By means of having 
the GIUqR molecule and antibodies thereto, a variety of 
diagnostic assays are provided. For example^ with 

30 antibodies, including monoclonal antibodies, to GlUgR, the 

presence and/or concentration of receptor in selected 
cells or tissues in an individual or culture of interest 
may be determined. These assays can be used in the 
diagnosis and/or treatment of diseases such as, for 

35 example , c rebral ischemia , Parkinsons , senile dementia 

and other cognitive disorders, Huntington's chorea, 
amyotrophic lateral scl, rosis, em sis, migrain , and 
others. 
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Numerous types of immunoassays are available 
and are known to those skilled in the art, e.g./ 
competitive assays, sandwich assays, and the like, as 
generally described in, e.g., U.S. Pat. Nos. 4,642,285; 
4,376,110; 4,016,043; 3,879,262; 3,852,157; 3,850,752; 
3,839,153; 3,791,932; and Harlow and Lane/ Antibodies, A 
Laboratory Manual , Cold Spring Harbor Publications, N.Y. 
(1988 ) , each incorporated by reference herein. In one 
assay format GlUgR is identified and/ or quantified by 
using labeled antibodies, preferably monoclonal 
antibodies which are reacted with brain tissues, e.g., 
cortex, striatum, hippocampus, cerebellum, and 
determining the specific binding thereto, the assay 
typically being performed under conditions conducive to 
immune complex formation. Unlabeled primary antibody can 
be used in combination with labels that are reactive with 
primary antibody to detect the receptor. For example, 
the primary antibody may be detected indirectly by a 
labeled secondary antibody made to specifically detect 
the primary antibody. Alternatively, the anti-GlUgR 
antibody can be directly labeled. A wide variety of 
labels may be employed, such as radionuclides, particles 
(e.g., gold, ferritin, magnetic particles, red blood 
cells), fluorophores, chemiluminescers , enzymes, enzyme 
substrates, enzyme cof actors, enzyme inhibitors, ligands 
(particularly haptens), etc. 

The Glu^jR DNA may be directly detected in cells 
with a labeled GIUqR DNA or synthetic oligonucleotide 
probe in a hybridization procedure similar to the 
Southern or dot blot. Also, the polymerase chain 
reaction (Saiki et al«, Science 239:487 (1988), and U.S. 
Pat. No. 4,683,195) may be used to amplify DNA sequences, 
which are subsequently detected by their characteristic 
size on agarose gels, Southern blots of these gels using 
GIUqR DNA or a oligonucleotide probe, or a dot blot using 
similar probes. The probes may comprise from about 14 
nucleotides to about 25 or more nucleotides, pref rably, 
40 to 60 nucleotid s, and in some instanc s a substantial 
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portion or ev n the entire cDNA of Glu^ may be used. The 
prob s are labeled with a detectable signal, such as an 
enzyme, bio tin,- a radionuclide, fluorophore, 
chemiluminescer, paramagnetic particle, etc. 

Kits can also be supplied, for use with the 
receptor of the subject invention in the detection of the 
presence of the receptor or antibodies thereto, as might 
be desired in the case of autoimmune disease. Thus, 
antibodies to GlxigR, preferably monospecific antibodies 
such as monoclonal antibodies, or compositions of the 
receptor may be provided, usually in lyophilized form in 
a container, either segregated or in conjunction with 
additional reagents, such as anti-antibodies, labels, 
gene probes, polymerase chain reaction primers and 
polymerase, and the like. 

The following examples are offered by way of 
illustration, not by limitation v 



Preparation of Glu^ enriched roRNA 

Total RNA was prepared from the cerebellum of 
rats using guanidine isothiocyanate (Chirgwin et al. 
Biochemistry 18:52-94 (1979)) and CsCl centrifugation. 
Poly(A)+ RNA was isolated using oligo d(T) cellulose 
chromatography . After 2 rounds of chromatography on 
oligo d(T) cellulose the RNA (800 fig) was divided into 
two aliquots and layered over 10-40% linear sucrose 
gradients in tubes for an SW 28 rotor. The gradients 
were centrifuged for 28 hours at 25,000 rpm to pellet RNA 
greater than 4 kb in size. The enriched RNA was injected 
into frog oocytes and assayed for the presence of the 
Glu^. 
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Injection of oocyte s and voltage-clamp assay of GlU^R 

Oocytes were prepared from ovarian lobes that 
were surgically removed from anesthetized Xenoous 
5 females. The ovarian lobes were washed, pulled apart 

into small clumps and dissociated by treatment with 
collagenase for 2-3 hours at 20 # C with constant, gentle 
agitation. The dissociation and defolicularization of 
the oocytes is completed manually after removal of the 

10 collagenase. Oocytes that were judged healthy and 

greater than 1 mm in diameter were trans f erred to a 50 mm 
sterile tissue culture dish and incubated in sterile, 
antibiotic-supplemented Barth's medium (88 mM NaCl, ImM 
KC1, 0.82 mM MgS0 4 , 0.33 mM Ca(N0 3 ) 2 , 0.41 mM CaCl 2 , 2.4 mM 

15 NaHC0 3 , 10 mM HEPES, pH 7.4, 0.1 mg/ml gentamicin, 0.01 

mg/ml penicillin, 0.01 mg/ml streptomycin, 0.5 mM 
theophylline, and 2.5 mM Na pyruvate) at 19 *C. 

Injection pipettes were pill led from hard glass 
tubing (Drummond) on a modified 700C Kopf vertical 

20 puller. The tip was broken and bevelled using a List 

Medical micro forge. Tip diameters of the pipettes ranged 
from 20-30 mM. Injection pipettes were made RNase free 
by heating to 285*C overnight. 

Following overnight incubation, healthy oocytes 

25 were selected for injection. RNA, which was stored at 

-70 # C in DEPC-treated water, was thawed and centrifuged 
at 15,000 g for five minutes. Injection was performed 
using a modified pipetting device (Drummond) . After 
injection, the oocytes were incubated in fresh, sterile 

30 Barth's medium which was changed daily, and unhealthy 

oocytes were removed. 

Voltage-clamp assays were carried out on 
injected oocytes which were each placed in a small 
chamber of approximately 500 /il in volume and which was 

35 continuously perfused with standard frog Ringer f s (115 mM 

NaCl, 2.5 mM KC1, 1.8 mM CaCl 2 , 10 mM HEPES, pH 7.2) at 
1-6 ml/min. The oocyte was impaled with two glass 
microel ctrodes for recording which, when fill d with 3 M 
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KCl, had a tip resistanc of 0.5 to 7.0 megaohms. One of 
the two electrodes was connected to a differential 
amplifier via! a silver/silver chloride half cell. The 
bath potential was measured by connecting the other side 
of the differential amplifier to the bath via a 
silver/silver chloride pellet and a Ringer/Agar bridge. 
A low! noise, high compliance, voltage-clamp system (NPI) 
was used to control the membrane potential and to measure 
membrane current. The oocyte membrane potential was 
maintained at -60 mV (inside cell negative) . One 
millimolar glutamate (Sigma), 100 MM quisqualate (Sigma), 
1 mH carbamylcholine (Sigma) , and the other drugs used in 
this assay were applied by switching the perfusing medium 
to a medium containing a drug for approximately three 
minutes, and the membrane current was recorded on a chart 

recorder (Linear Instruments) . 

After impaling the oocyte with the two 

microelectrodes, and imposing the voltage-clamp, the 
membrane current (the holding current) gradually declines 
to a steady state over a period of several minutes. When 
the holding current stabilizes, so that the chart record 
is horizontal, the drug is applied for one to three 
minutes. An oocyte is judged to have a positive response 
if a rapid inward current spike (downward deflection on 
the chart), followed by slow current oscillations of 
decreasing magnitude , is observed . Our lower limit of 
detection depended on the steadiness of the holding 
current prior to drug application, but was in the range 
' of 5-10 nA. : 

Construction of PVEGT* 

S to permit transcription of cloned cDNA without 
prior endonuclease digestion, bacteriophage T7 
transcriptional terminators were added to a cloning 
v ctor . I Plasmid pVEGT ' is d scribed in copending 
U.S.S.nL 07/581,342, which is incorporated by reference 
h rein.? The s qu nee of th putativ T7 RNA 
transcription terminator, which li s betwe n gene 10 and 
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gene 11 of bacteriophage T7, is disclosed by Dunn and 
Studier ( J. Mol. Biol. 166: 477-536 (1983)). As shown in 
Figure 5, four synthetic oligonucleotides were designed 
from this sequence and ligated into the vector pGEM-1 
(obtained from Promega Biotec, Madison, WI) , a plasmid 
containing a bacterial origin of replication, ampicillin 
resistance gene, and the T7 promoter adjacent to a 
multiple cloning site. Terminal phosphates were added to 
the 5' ends of oligonucleotides ZC776 and ZC777 (Sequence 
ID Nos. 4 and 5) with T4 polynucleotide kinase and ATP, 
under standard conditions (Maniatis et al. ibid). (The 
sequences of these and other oligonucleotides referred to 
herein are shown in Table 1.) After the incubation, the 
kinase was heat killed at 65 § C for 10 min. Twenty-five 
nanograms of oligonucleotide ZC775 (Sequence ID Number 3) 
and 25 ng of oligonucleotide ZC776 (Sequence ID Number 4) 
were annealed by incubation at 65*C for 15 minutes, then 
allowed to cool to room temperature in 500 ml of water. 
Oligonucleotides ZC777 and ZC778 (Sequence ID Nos. 5 and 
6) were similarly annealed. The annealed 
oligonucleotides were stored at -20 # C until use. The 
vector pGEM-1 was digested with Pst I and Hind III, and 
the linearized vector DNA was purified by agarose gel 
electrophoresis. The synthetic T7 terminator (annealed 
oligonucleotides ZC775, ZC776, ZC777 and ZC778; Sequence 
ID Nos. 3, 4, 5 and 6) was then cloned into pGEM-1. 
Twenty-five nanograms of vector plus an equal molar 
amount of each of the annealed oligonucleotides 
ZC775/ZC776 (Sequence ID Nos. 3 and 4) and ZC777/ZC778 
(Sequence ID Nos. 5 and 6) were combined in a 10 Ml 
reaction mix. After an overnight ligation at 14 # C, the 
DNA was transformed into competent g. ?oli JM83 cells, 
and the transformed cells were selected for ampicillin 
resistance. Plasmid DNA was prepared from selected 
transf ormants by the alkaline lysis procedure (Birnboim 
and Dbly, Nuc. Acids Res: 7:1513-1523 (1979) ) . A portion 
of th DNA from these samples was cut with Pst I and Hind 
III and analyzed on a 4% polyacrylamid gel to identify 
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clon s that releas d an 80 bp Pst I-Hind III fragment. 
Other diagnostic cuts, such as Eco Rl and Not I, were 
also made. One of the isolates, designated pGEMT, was 
shown by restriction analysis to contain the T7 
terminator fragment. 

Table 1 

nnmnuelflft tide SemienrftS f5' - 3') 
ZC775 (Sequence ID Number 3): 
" GCT AGC ATA ACC CCT TGG GGC CTC TAA ACG GGT CT 

ZC776 (Sequence ID Number 4): 

CTC AAG ACC CGT TTA GAG GCC CCA AGG GGT TAT GCT AGC TGC A 
ZC777 (Sequence ID Number 5) : 

TGA GGG GTT TTT TGC TGA AAG GAG GAA CTA TGC GGC CGC A 
ZC778 (Sequence ID Number 6): 

AGC TTG CGG CCG CAT . AGT TCC TCC TTT CAG CAA AAA ACC C 

ZC1751 (Sequence ID Number 7) : 
AAT TCT GTG CTC TGT CAA G 

ZC1752 (Sequence ID Number 8): 
GAT CCT TGA CAG AGC ACA G 



ZC2063 (Sequence ID Number 9) : 
GAT CCA AAC TAG TAA AAG AGC T 



ZC2064 (Sequence ID Number 10) 
CTT TTA CTA GTT TG 
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ZC2938 (Sequence ID Number 11): 

GAC AGA GCA CAG ATT CAC TAG TGA GCT CTT TTT TTT TTT TTT T 

ZC3015 (Sequence ID Number 12) : 
TTC CAT GGC ACC GTC AAG GCT 



ZC3016 (Sequence ID Number 13): 
15 AGT GAT GGC ATG GAC TGT GGT 

* 

ZC3652 (Sequence ID Number 14): 
20 ACA TGC ACC ATG CTC TGT GT 

ZC3654 (Sequence ID Number 15) : 
AGT GAT GGC ATG GAC TGT GGT 

■25' ; • ' ' . . V' 

The native T7 terminator from plasmid pAR2529 
(Rosenberg et al., Ggne 56:125-135 (1987) ) was added to 
plasmid pGEMT. Plasmid pGEMT was digested with Bam HI and 

30 plasmid PAR2529 was digested with Bam HI and Bgl II 

(Figure 1). The Bam HI-Bgl II terminator fragment from 
PAR2529 was purified by agarose gel electrophoresis. The 
terminator fragment was ligated to iBam HI digested pGEMT, 
and the DNA was transformed into competent £. sail LM1035 

35 cells. Colonies that were ampicillin resistant were 

inoculated into 5 ml cultures for overnight growth. 
Plasmid DNA prepared by the alkaline lysis procedure was 
screened for proper terminator orientation by Bam Hi-Sal 
I dig stion and electrophor sis on an 8% polyacrylamid 

40 gel. A clon that contained the terminator in the 

correct ori ntation, as evidenced by the presenc of a 
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130 bp Bam Hi-Sal I fragment , was chos n and named pGEKTT 
(Figure 1) . 

To allow pGEMTT to be packaged as single-stranded 
DNA in the presence of M13 phage proteins, the M13 
intergenic region from pUC382 (similar to pUC118 and 119 
as disclosed by Vieira and Messing, Methods Enzvmol* 153: 
* 3-11 (1987) was added to pGEMTT (Figure 1) . Plasmid 
pGEMTT was digested with Fsp I and Nar I, and the 
fragment containing the T7 promoter and transcription 
terminator was purified. Plasmid pUC3 82 was digested 
with Fsp I and Nar I, and the fragment encoding the 
ampicillin resistance gene and the M13 intergenic region 
was gel purified. These fragments were then ligated 
together in the presence of T4 DNA ligase. The ligated 
DNA was transformed into competent coli LM1035 cells. 
Plasmid DNA from twelve ampicillin-resistant colonies was 
prepared by the alkaline lysis method, and the DNA was 
screened by digestion with Ava I. The appropriate 
construction gave two bands, one of 2430 bp and another 
of 709 bp. One such isolate was chosen and named pVEG. 
Synthetic oligonucleotides encoding the prime sequence 
were added to pVEG between the Bam HI and Eco RI sites 
(Figure 1) . Plasmid pVEG was digested with Bam HI and 
Eco RI and the vector fragment was gel purified. 
Ninety-six nanograms each of oligonucleotides ZC1751 and 
ZC1752 (Sequence ID Nos. 7 and 8) were annealed in 4.5 til 
of 10 mM Tris pH 7.5, 20 mM MgCl 2 and 10 mM NaCl at 65 *C 
for 20 minutes, then the mixture was cooled to room 
temperature over a period of 30 minutes. The annealed 
oligonucleotides were ligated to the pVEG vector fragment 
with T4 DNA ligase and then transformed into competent £. 
coll LM1035 cells. After growing overnight to develop 
the colonies, a filter lift was taken of the colonies on 
the agar plate. The filter was probed with 32 P-labeled 
oligonucl otid ZC1751 (Sequence ID Number 7) . All of 
the colonies were positive. Plasmid DNA was prepared 
from cultures grown from 12 of th coloni s. The plasmid 
DNA was screened by dig stion with Sst I to verify th 
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absence of the Sst I site between the Eco RI and Bam HI 
sites of pVEG. All 12 f the plasmid DNAs were negative 
for Sst I digestion. One of these 12 isolates was chosen 
and named pVEG 1 . 
5 A polyadenylate sequence derived from an 

Aspergillus alcohol dehydrogenase cDNA was added to pVEG. 
As shown in Figure 1, plasmid pM098 (disclosed in 
published European patent application EP 272,277 and 
deposited with American Type Culture Collection under 

10 accession number 53428) was digested with Dra I and Bam 

HI, and the approximately 150 bp poly (A) fragment was 
purified by agarose gel electrophoresis. This fragment 
contained mostly poly (A) sequence with very little 
flanking cDNA. To clone the poly (A) cDNA fragment into 

15 pVEG, pVEG was digested with Bam HI and Sma I, and the 

3.4 kb vector fragment was gel purified. The vector and 
poly (A) fragments were ligated together with T4 DNA 
ligase to produce vector pVEGT (Figure 1). 

Synthetic oligonucleotides encoding the prime 

20 sequence were added to pVEGT. To accomplish this,.. pVEGT 

was digested with Not I and Sst I, and the 370 bp 
fragment containing the poly (A) sequence and the two T7 
transcriptional terminators was purified by agarose gel 
electrophoresis. Plasmid pVEG 1 was digested with Not I 

25 and Bam HI, and the 3.2 kb vector fragment was 

gel-purified. Two oligonucleotides (2C2063 and ZC2064; 
Sequence ID Nos. 9 and 10} that formed, when annealed, a 
Bam Hl-Sst I adapter were synthesized. The two 
oligonucleotides were individually kinased and annealed, 

30 and ligated with the linearized vector and the 

poly (A) -terminator fragment. The resultant vector, 
designated pVEGT* (Figure 1) , contained a T7 RNA 
transcription promoter, an Eco RI cloning site flanked by 
the prime sequence, a poly (A) tract, and two T7 RNA 

35 polymetas terminators. 
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mnsi-ruetlon of - m mn library from rat cerebellum polV(A)t 

Because there was evidence suggesting that the 
GlUgR was encoded a very large mRNA of 7 kb (Fong, 
Davidson, and Lester, Svnapse 2:657 (1988) ) and because 
full length cDNA encompassing the coding sequence is 
required for functional cloning of cDNA, measures were 
taken to optimize for synthesis of large cDNA. A novel 
method of cDNA synthesis was developed which yielded 
large full length cDNA. This was evident by 
demonstration that full length 7.5 kb cDNA could be 
synthesized from a model 7.5 kb mRNA and that large full 
length cDNA were present in a library constructed from 
poly (A) + SNA as demonstrated by Southern blot analysis. 
In addition, all enzymes which were important in this 
method were pretested and selected from a large number of 
lots of enzymes available from commercial suppliers. 
Once a satisfactory lot was identified, a large amount of 
the enzyme was purchased and the enzyme was stored at 
-70 "C until used. Once used, the enzyme was stored at 
-20 *C for a few months and then discarded. Different 
"lots" of enzymes from commercial suppliers, including 
lots of superscript reverse transcriptase (BRL) , £• soil 
DNA polymerase I (Amersham) and Murig bean nuclease (NEB) , 
Which were used in the cDNA synthesis, were screened for 
quality in test synthesis assays, superscript reverse 
transcriptase- lots were assayed for the ability to 
synthesize unit length (7.5 kb) first strand cDNA from 
7.5 kb RNA (BRL) control. Conditions for first strand 
synthesis with Superscript reverse transcriptase lots 
were prepared as described below. Radiolabeled first 
strand cDNA was analyzed by alkaline agarose gel 
electrophoresis. Superscript lots capable of producing 
unit length, 7.5 kb cDNA were selected for use. 

£. col i DNA polvm rase I lots were assayed for the 
ability to produce, by hairpin DNA formation, full-1 ngth 
second strand cDNA from the 7.5 kb unit-length first 
strand cDNA. The second strand cDNA synth ses w re 
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carried out as described below. The quality of the 
second strand syntheses were assessed by alkalin agarose 
electrophoresis of the radiolabeled product. DNA 
polymerase I lots capable of producing 15 kb second 
strand DNA from the 7.5 kb unit length first strand cDNA 
were selected for use • 

Mung bean nuclease lots were tested for the 
ability to clip the hairpin DNA formed during second 
strand synthesis without degrading the cDNA. In 
addition, varying concentrations of enzyme were added to 
determine the optimum enzyme concentration for the 
conditions set forth below. The reactions were assessed 
by alkaline agarose electrophoresis. Lots and 
concentrations resulting in the production of 7.5 kb unit 
length cDNA were selected for use. 

Total RNA was prepared from rat cerebella using 
guanidine isothiocyanate (Chirgwin et al. Biochemistry 
18:52-94 1979) and CsCl centrifugation (Gilsin et al. 
Biochemistry 13:2633-2637 1974). Poly(A)+ RNA was 
selected from the total RNA using oligo d(T) cellulose 
chromatography (Aviv and Leder, Proc. Natl. Acad. Sci. 
USA 69:1408 (1972)) . 

First strand cDNA was synthesized from one time 
poly d(T) -selected cerebellum poly (A) + RNA in two 
separate reactions. One reaction, containing 
radiolabeled dATP, was used to assess the quality of 
first strand synthesis. The second reaction was carried 
out in the absence of radiolabeled dATP and was used, in 
part, to assess the quality of second strand synthesis. 
Superscript reverse transcriptase (BRL) was used 
specifically as described below. A 2.5x reaction mix was 
prepared at room temperature by mixing, in order, 10 pi 
of 5x reverse transcriptase buffer (BRL; 250 mM Tris-HCl 
pH 8.3, 375 mM KCl, and 15 mM MgCl 2 ) , 2.5 nl 200 mM 
dithiothreitol (made fresh or stored in aliquot s at 
-70 # C) and 2.5 Ml of ad oxynucl otide triphosphat 
solution containing, 10 mM ach of dATP, dGTP, dTTP and 
5-m thyl dCTP (Pharmacia). The reaction mix was 
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aliquoted into two tubes of 7.5 Ml each. To the first 
tube, 1.3 Ml of 10 MCi/Ml a 32 P-dATP (Amersham) was added 
and 1.3 Ml of water was added to the second reaction 
tube. Seven microliters from each tube was transferred 
to reaction tubes. Fourteen microliters of a solution 
containing 10 ng of cerebellum poly (A) + RNA diluted in 14 
Ml of 5 mM Tris-HCl pH 7.4, 50 mM EDTA was mixed with 2 
Ml of 1 M9/M1 first strand primer, ZC2938 (Table l; 
Sequence ID No. 11), and the primer was annealed to the 
RNA by heating the mixture to 65 *C for 4 minutes, 
followed by chilling in ice water. Eight microliters of 
the RNA-primer mixture was added to each of the two 
reaction tubes followed by 5 <il of 200 U/m1 Superscript 
reverse transcriptase (BRL) . The reactions were mixed 
gently, and the tubes were incubated at 45 *C for 30 
minutes. After incubation, 80 >1 of 10 mM Tris-HCl pH 
7.4, 1 mM EDTA was added to each tube, the samples were 
vortexed and centrifuged briefly. Three microliters of 
each reaction was removed to determine total counts and 
TCA precipitable counts (incorporated counts) . ; Two 
microliters of each sample was analyzed by alkaline gel 
electrophoresis to assess the quality of first strand 
synthesis. The remainder of each sample was ethahol 
precipitated. The nucleic acids were pelleted by 
centrifugation, washed with 80% ethanol and air dried for 
ten minutes. The first strand synthesis yielded 1.4 Mg 
of cerebellum cDNA or a 28% conversion of RNA into DNA. 

Second strand cDNA synthesis was performed on the 
RNA-DNA hybrid from the first strand reactions under 
conditions which encouraged first strand priming of 
second strand synthesis resulting in DNA hairpin 
formation. The nucleic acid pellets containing the first 
strand cDNA were resuspended in 71 Ml of water. To 
assess the quality of second strand synthesis, o P-dATP 
was add d to the unlabeled first strand cDNA. To 
ncourage formation of the hairpin structure, all 
reagents xcept th enzymes w re brought to room 
temperatur , and th reaction mixtures were s t up at 

. ■'■ ■ ■ • ^ 
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room temperature. (Alternatively, the reagents can be on 
ice and the reaction mixture set up at, room temperature 
and allowed to equilibrate at room temperature for a 
short time prior to incubation at 16 # C.) Two reaction 
tubes were set up for each synthesis. One reaction tube 
contained the unlabeled first strand cDNA and the other 
reaction tube contained the radiolabeled first strand 
cDNA. To each reaction tube, 20 Ml of 5x second strand 
buffer (100 mM Tris, pH 7.4,. 450 mM KC1, 23 mM MgCl 2 , 50 
mM (NH 4 ) 2 S0 4 ) , 3 til of beta-NAD and 1 Ml of a 
deoxynucleotide triphosphate solution containing 10 mM 
each of dATP, dGTP, dTTP and dCTP (Pharmacia), 1 til 
a 32 P-dATP or 1 /il of water (the radiolabeled dATP was 
added to the tube containing the unlabeled first strand 
cDNA), 0.6 /il of 7 U/m1 £. Sfili DNA ligase 
(Boehringer-Mannheim) , 3.1 Ml of 8 U/m1 £• coli DNA 
polymerase I (Amersham) , and 1 m1 of 2 U/m1 of RNase H 
(BRL) • The reactions were incubated at 16*C for 2 hours. 
After incubation, 3 Ml was taken from each reaction tube 
to determine total and TCA precipitable counts. Two 
microliters of each sample was analyzed by alkaline gel 
electrophoresis to assess the quality of second strand 
synthesis by the presence of a band of approximately 
twice unit length. To the remainder of each sample, 2 Ml 
of 2.5 tig/ til oyster glycogen, 5 Ml of 0.5 M EDTA and 200 
Ml of 10 mM Tris-HCl pH 7.4, 1 mM EDTA were added, the 
samples were phenol -chloroform extracted, and isopropanol 
precipitated. The nucleic acids were pelleted by 
centrifugation, washed with 80% ethanol and air dried. 
The yield of double stranded cDNA in each of the 
reactions was approximately 2 M9« 

The single-stranded DNA in the hairpin structure 
was clipped using mung bean nuclease. Each second strand 
DNA sample was resuspended in 12 Ml of water. Two 
microliters of 1 Ox mung bean buffer (0.3 M NaOAC, pH 4.6, 
3 M NaCl, 10 mM ZnS0 4 ) , 2 Ml of 10 mM dithiothreitol , 2 Ml 
of 50% glyc r 1, and 2 m! of 10 U/m1 mung b an nuclease 
(NEB, lot 7) were add d to each tub , and the reactions 
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were incubated at 30 *C for 30 minutes. Aft r incubation, 
80 Ml of 10 mM Tris-HCl pH 7,4, 1 xnM EDTA was added to 
each tube, and 2 /il of each sample was subjected to 
alkaline gel electrophoresis to assess the cleavage of 
the second strand product into unit length cDNA. One 
hundred microliters of 1 M Tris-HCl pH 7.4 was added to 
each sample, and the samples were twice extracted with 
phenol-chloroform. Following the final phenol-chloroform 
extraction, the DNA was isopropanol precipitated. The 
DNA was pelleted by centrifugation, washed with 80% 
ethanol and air dried. Approximately 2 fig of DNA was 
obtained from each reaction. 

The cDNA was biuht-ended with T4 DNA polymerase 
after the cDNA pellets were resuspended in 12 /il of 
water. Two microliters of 10X T4 DNA polymerase buffer 
(330 mM Tris-acetate, pH 7.9, 670 mM KAc, 100 mM MgAc, 1 
mg/ml gelatin), 2 (il of 1 mM dNTP, 2 /il 50 mM 
dithiothreitol, and 2 (il of 1 U//il T4 DNA polymerase 
(Boehringer-Mannheim) were added to each tube. After an 
incubation at 15 *C for 1 hour, 180 Ml of 10 mM Tris-HCl 
pH 7.4, 1 mM EDTA was added to each sample, and the 
samples were phenol-chloroform extracted followed by 
isopropanol precipitation. The cDNA was pelleted by 
centrifugation , washed with 80% ethanol and air dried. 
Eco RI; adapters (Invitrogeii, Cat. # N4 09-20) were ligated 
to the blunted cDNA after the DNA from each reaction was 
resuspended in 6.5 /il water. 

The first strand primer encoded an Sst I cloning 
site toi allow the cDNA to be directionally cloned into an 
expression vector. The cDNA was digested with Sst I 
followed by phenol-chloroform extraction and isopropanol 
precipitation. After digestion, the cDNA was 
electrophoresed in a 0.8% low melt agarose gel, and the 
cDNA over 4.2 kb was electroeluted using an Elutrap 
(Schleicher and Schuell, Keene, NH) . The electroeluted 
cDNA inl 500 /il of buff r was isopropanol precipitat d and 
the cDNA was pelleted by centrifugation. The cDNA pellet 
was washed with 80% ethanol. 
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A cerebellum cDNA library was established by 
ligating the cDNA to the Eco Rl-Sst I digested, agarose . 
gel purified pVEGT 1 . . 

Ten sublibraries of one million clones each were 
constructed representing a library of ten million 
independent clones. To prepare each sublibrary f 80 ng of 
linearized vector were ligated to 40 ng of cDNA. After 
incubation at room temperature for 11 hours, 2.5 fxg of 
oyster glycogen and 80 Ml of 10 mM Tris-HCl, 1 mM EDTA 
was added and the sample was phenol-chloroform extracted 
followed by ethanol precipitation. The DNA was pelleted 
by centrifugation, and the DNA pellet washed with 80% 
ethanol. After air drying, the DNA was resuspended in 3 
Ml of water. Thirty-seven microliters of 
electroporation-competent DH10B cells (BRL) was added to 
the DNA and electroporation was completed using a BioRad 
electroporation unit. After electroporation, 4 ml of SOC 
(Maniatis et al. ) was added to the cells, and 400 fil was 
spread on each of 10-150 mm LB ampicillin plates. Each 
plate represented a sublibrary of 100,000 clones. After 
an overnight incubation, the cells were harvested by 
adding 10 ml of LB ampicillin media to each plate and 
scraping the cells into the media. Glycerol stocks and 
plasmid DNA were prepared from each plate. The library 
background (vector without insert) was established at 
about 15%. 

Detection of Glu^ activity from the cDNA library 
The Xenopus oocyte efficiently translates 
exogenously added mRNA. Preliminary experiments were 
done using the mouse mi muscarinic receptor cDNA (a G 
protein-coupled receptor that can be detected by voltage- 
clamp) cloned into pVEGT 1 • Injection of RNA transcribed 
in vitro from increasing dilutions of the ml template DNA 
indicated that ml agonist induced activity could b 
d t cted for one clone in a pool size f 100, 000 ; A 
cerebellum sublibrary was plated into ten pools of 
100,000 unique clones. 
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Th pools could also b replica plated onto a 
nitrocellulose filter and th original and replica 
allowed to grow for a few hours. The original plate is 
scraped to harvest all the colonies. Plasmid DNA is 
prepared and purified by cesium chloride gradient 
ultracentrifugatibn. The DNA from each pool is 
transcribed in vitro with T7 RNA polymerase in the 
presence of 7-methyl-G, the capped nucleotide, to 
increase translation efficiency. Template DNA 
transcription reactions are spiked with a dilution of two 
control genes cloned into pVEGT 1 : the mouse ml gene and a 
secreted version of the human placental alkaline 
phosphatase gene (SEAP; Tate et »i- r pyd. Am. Soc. Exp v 
Biol. 8: 227-231 (1990), incorporated by reference 
herein) . Transcription from the control genes would 
allow selection of oocytes that more efficiently 
translate the injected RNA, arid a determination whether 
oocytes that are negative for the GIUqR are true 
negatives, that is, still having a detectable ml agonist- 
induced response. 

Plasmid DNA prepared frbm each of the 10 pools of 
100,000 clones, which in total represented one sublibrary 
of one million clones of the cerebellum cDNA library, was 
purified by cesium chloride gradient ultracentr if ugat ion. 
The DNA was transcribed in vitro with T7 RNA polymerase 
(Pharmacia) in the presence of capped nucleotide (GpppG, 
Pharamcia) . The presence of a poly (A) sequence and two 
T7 RNA polymerase terminators in pVEGT 1 resulted in RNA 
with a capped 5" end, the sequence of the cDNA insert, 
and 3 1 poly (A) tails. Capped RNA is believed necessary 
for efficient translation in oocytes (Noma et al. ttatUES 
319:640 (1986)) and the poly(A) sequence has been shown 
to increase the synthesis of a protein in oocytes by more 
then 40 fold . The transcription reaction tubes were set 
up by adding 12 pi of 5x transcription buf fer (Stratagene 
Cloning Systems, La Jolla, CA) , 3 pi each of 10 mM ATP, 
CTP, GTP, and UTP, 6 pi of 10 mM GpppG (Pharmacia) , 6 pi 
of 1 mg/ml BSA, 3 pi of 200 mM DTT, 1.5 pi of 40 V/pl 
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RNasin (ProMega Biotech, Madison, WI) , 8.5 Ml of water, 
10 Ml of cDNA containing 5 to 10 fig DNA, and 1 /il of 70 
U//il T7 RNA polymerase. After mixing, 10 /il of the 
reaction was transferred to a tube containing 0.5 fid of 
a 32 P-UTP to determine the total counts and counts 
incorporated into RNA. The samples were incubated at 
37 *C for one hour. The cDNA in the unlabeled samples was 
degraded with the addition of 1 Ml of 200 mM DTT, 2 Ml of 
30 U/Ml DNase I, and 0.5 Ml of 40 U/m1 RNasin and the 
incubation was continued at 37 # C for 15 minutes. Forty 
microliters of water was. added to the radiolabeled, 
reactions, and 1 Ml was removed from each sample and 
counted to determine total counts. The remainder of the 
labeled samples were ethanol precipitated. The samples 
were centrifuged to collect the RNA and the RNA pellets 
were counted to determine the counts incorporated into 
RNA. After the DNA degradation reaction in the unlabeled 
samples, 70 Ml of 10 mM Tris-HCl, 1 mM EDTA was added to 
each sample, and the samples were twice-extracted with 
phenol-chloroform followed by one chloroform extraction. 
The RNA was ethanol precipitated. After centrifugation 
to collect the RNA, the pellets were washed with 80% 
ethanol, followed by air drying for 10 minutes. A 
typical yield of the unlabeled RNA was 20 to 30 M9* The 
unlabeled RNA was resuspended at 2 M9/M1 in 
diethylpyrocarbonate (DEPC, Sigma) treated water and 
stored at -70 # C. 

Prior to microinjection into oocytes, the RNA 
samples were thawed and centrifuged in a microfuge for 5 
minutes to remove any particles that might clog a 
microinjection pipet. After centrifugation, 80% of each 
sample was removed and split into two tubes. 

The RNA from each of the 10 sublibraries were 
injected into oocytes as described above and translation 
was allow d for four days. Expression of GlUgR activity 
was ass ss d by voltage-clamp assay as described ab ve. 
One of the 10 sublibrari s, Z93-1.9, pr duced a signail 
with administration of quisqualate t the oocyte. 
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subdlvlsdlnn at the cdna llbr ^ ^y pool to obtain Pure GAVlrB 
clone ' > 

The DNA pool (Z93-1.9) was subdivided by plating 
clones from the glycerol stock onto LB ampicillin plates. 
To determine the number of clones that should be plated 
for the subdivision of the 100,000 clone pool to identify 
a positive clone, the probability equation N - In (1- P) 
/ In (1 - f) (Maniatis et al., ibid.) was used, where P 
is the desired probability of including the clone of 
interest, f is the fraction of positive clones in the 
pool, and N is the number of clones to be plated to 
provide the given probability. For a probability of 
99.8% for a pool size of 100,000 to contain one positive 
clone, 621,461 clones should be plated. 

Forty-eight 150 mm LB ampicillin plates were 
plated with the glycerol stock representing the 100,000 
positive pool, Z93-1.9, at a density of approximately 
14,000 clones per plate to give a total of 670,000 
clones. After an overnight incubation 37 *c, the bacteria 
on each plate were harvested into 10 ml of Solution I (as 
described by Birnboim and Doly, NMT- &g*ds Res. 7:1513 
(1979)), incorporated by reference herein) . A glycerol 
stock was prepared from a portion of the cells, and 
plasmid DNA was prepared from the remainder of the cells. 
Six pools of DNA representing eight of the LB ampicillin 
plates each were prepared by combining one tenth of the 
plasmid DNA from groups of eight plates into each pool. 
The plasmid DNA from these six pools was purified by 
cesium chloride gradient centrifugation. The DNA was 
transcribed into UNA as outlined above. Transcription of 
the parent pool Z95-1.9 was included as the positive 
control. Oocytes were injected with the RNA and 
voltage-clamp assays on the oocytes identified pool 
Z99-25-32 as positive for GlUgR. Pool Z99-25-3 2 contained 
DNA pr pared from plates 25 through 32. 

Plasmid DNA from plates 25 t 32 wer cesium 
chloride banded and transcribed into RNA as described 
above al ng with th positive par nt pool Z99-25-32. 
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Oocyt s were injected with the RNA and vol tag clamp 
assays, carried out as described above, identifi d pools 
Z104-25 and Zlll-32 as being weakly positive, Z106-27 and 
Z109-30 as intermediately positive, and Z108-29 and 
Zllb-31 as the most positive. The pool resulting in 
ZllO-31 was chosen for further subdivision. 

Identification of positive pools from the 
subdivision of the positive pool of 14,000 (ZllO-31) from 
the glycerol; stock was unsuccessful. Therefore, plasmid 
DNA prepared: from the pool resulting in ZI10^31 was 
electroporated into bacteria and plated on 60 plates at a 
density of 1,000 clones/plate. Plasmid DNA was prepared 
from the bacteria harvested from each plate. Aliquots of 
the plasmid DNA from each plate were mixed to make six 
poois representing ten plates each. The plasmid DNA was 
cesium chloride banded, and the RNA was transcribed as 
described above. RNA was transcribed from pools 2108-29, 
Z110-31, and a muscarinic receptor cDNA, ml, for use as 
positive controls. The RNA was injected into oocytes and 
voltage-clamp assays were carried out as described. above. 
The assays identified pool Z133-21 to 30 as positive. 

Plasmid DNA from plates 21 to 30 were cesium 
chloride banded and transcribed as described above. The 
transcribed RNA and the RNA from the parent pool Z133-21 
to 30; were injected into oocytes and assayed as described 
above. The voltage-clamp assay identified pool Z142-22 
as positive. 

1 Identification of positive pools by the 
subdivision of the positive pool Z142^22 from a glycerol 
stock proved unsuccessful. Restriction analysis of 
plasmid DNA prepared from randomly selected clones from 
pools ZllO-31 (the pool of 14,000) and Z142-22 (the pool 
of 1,000) indicated that 50% of pool Z110 31 and 68% of 
pool Z142 - 22 were clones without inserts. 

T ass ss physical methods for enriching for the 
GIUqR clone and to establish how many clones from pool 
Z142-22 n ded to be assay d to include a GlUgR clone, 
undigested plasmid DNA from p ol Z142-22 was 
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1 ctrophoresed on an agarose gel. The super-coil band 
representing v ctor without insert was cut out and the 
remainder of the DNA was eluted from the gel. The DNA 
was then electroporated into bacteria cells, and plated 
at densities of 3,400/ 6,900, and 13,800 clones per 
plate. The plates were replica plated and grown , 
overnight. Plasmid DNA was prepared from the cells 
harvested from the replica of each plate. The plasmid 
DNA was transcribed, and the RNA was assayed in oocytes 
as described above. As a control, each pool contained 
the equivalent of one colony of ml as an internal 
positive control. In addition, ml was used as an 
external positive control. The voltage-clamp assays 
identified the DNA from the 6,900 clone pool (2167-7) as 
positive. 

The clones represented on the 6,900 clone plate 
that resulted in the positive pool Z167-7 were subdivided 
by replica plating the master plate onto a Biodyne-A 
nylon membrane on an LB ampicillin plate. The replica 
plate was incubated four hours at 37 *C. After 
incubation, sub-pools were prepared by removing the 
membrane from the plate, taping the membrane to a sterile 
glass plate on a light box, and overlaying the membrane 
with a grid which divided the membrane into 100 sections. 
The sections of the grid and underlying membrane were 
then cut out with a razor blade that had been dipped in 
alcohol and flamed between each cut. Alcohol-treated, 
flamed forceps were used to transfer each membrane 
section to a test tube containing 12.5 ml of LB 
ampicillin media. The cultures containing the membrane 
sections were incubated overnight at 37 # C. After 
incubation, 0.5 ml of each culture was mixed with 0.5 ml 
of 50% glycerol and stored at -70 # C to establish glycerol 
stocks of each sub-pool. Aliquots of the 100 cultures 
were pooled in a 10 X 10. matrix with samples (1) through 
(10) on the abscissa and sampl s (a) through (j) n the 
ordinate. For example, 1 ml of cultur s (1) through (10) 
were added to tube 1 and 1 ml of cultur s (1), (11), 
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(21), (31), (41), (51), (61),. (71), (81), and (91) w re 
added to tube (a) and so on until 10 rows f 10 and 10 
columns containing pools of 10 cultures each were 
completed. Ten microliters of an overnight culture 
containing ml -trans formed bacteria was added to each pool 
as an internal control. Plasmid DNA was prepared from 
the 20 sub-pools, and the DNA was purified by cesium 
chloride gradient centrifugation. RNA was transcribed 
from the plasmid DNA and was assayed in oocytes as 
described above. Positive controls were the parent pool 
Z 167-7 and pure ml RNA. The voltage-clamp assays 
indicated that only pools Z175-1 and Z191-g were 
positive. Consulting^ the matrix, this indicated that the 
membrane section number (7) contained the Glu^ clone. 

To subdivide the clones contained in section (7), 
a piece of Biodyne A membrane was applied to the master 
plate containing section (7) , the membrane extending 
beyond section (7) on each side by half the width of 
section (7) . The membrane was removed from the plate, 
applied to a fresh LB ampicillin plate colony side v up, 
and incubated overnight at 37* C. The membrane was 
subdivided as described above with the central region of 
the membrane, the actual section (7) area, divided into 9 
small, equivalent-sized squares and the membrane on each 
side of section (7) was taken as four additional areas. 
Each membrane section was used to inoculate a 10 ml 
liquid culture. Bacteria transformed with the ml clone 
were used as an internal control in each culture as 
described above. After overnight incubation at 37*0, 
plasmid DNA was prepared, and the DNA was purified by 
cesium chloride gradient centrifugation. RNA was 
transcribed and assayed in oocytes as described above 
using RNA from ml and the parent pool number (7) as 
positive controls. GlUgR activity was found in only pool 
Z203-7 corresponding to membrane section number (7). 

Pool Z203-7 was subdivided~by electroporating the 
plasmid DNA prepared from the membrane section numb r (7) 
into DH10B electroporation-competent cells. The 
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transformants were plat d at a density enabling 
individual colonies to be pick d . Individuals clon s 
were picked to a master piate and into 2 ml of LB 
ampicill in media . The cultures were incubated overnight ., 
and piasmid DNA was prepared by the method essentially 
described by Holms and Quigley (ftna.1. Bloc* 1141 193 ' 
(1981)). Restriction analyst? suggested that the clones 
/ were grouped into 7 different classes of clones. Piasmid 
DNA, prepared from each class , representing fifty total 
clones were prepared, transcribe*, and assayed in oocytes 

as described above. However, none of the clones were 

.positive. • . 

To screen for positive clones, electrpporat ion- 
competent S. coli DHlpB cells were electroporated with 
the DNA prepared from membrane section number (7) 
(Z203-7) and were plated at 180, 360/ 900, and 1800 
colonies per plate. The plates were incubated overnight, 
and replica plates were prepared as described above. 
Piasmid DNA prepared from each replica plate was combined 
with, 1 to .1000 parts of ml as an internal control. The 
: DNA pools, the ml clone and toe parent pool Z203-7 were 
transcribed, and the RNA was assayed by oocyte injection. 
The first transcription and injection showed no 
positives, however, upon retranscription and reanalysis 
the 1800 clone pool (Z264-1800) was positive for GIUqR 



To subdivide the positive pool of 1800 
(Z264-1800) , all of the colonies from the plate of 1800, 
1528 in total, were each picked to two 100 mm LB 
ampicillin agar plates on a 100 colony grid. After 
overnight growth, one set of the duplicate plates was 
designated as a master set and was placed at 4«C. The 
other set was replica plated to a third set of plates. 
After overnight incubation of these plates, the cells on 
the replica plat s were harv sted into media and piasmid 
DNA was prepared from the pooled ceils. As d scribed 
above, an internal ml control was included in each DNA; 
pr paration. ml DNA and the parent Z264-1800 DNA were 
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used as external positive controls. Plasmid DNA pr par d 
from the 16 plates was transcribed, and the RNA was 
assayed in oocytes as described above. One of the pools 
of 100 clones, Z256-I produced GIUqR activity. 

To identify which clone of the 100 clones from 
Z256-I produced the GIUqR activity, a 10 x 10 matrix of 
the clones was constructed. A liquid culture of each 
clone was grown. One milliliter of each culture was 
added to each of two tubes representing the appropriate 
row and column of the 10 x 10 matrix. As described 
previously, plasmid DNA encoding ml was used as an 
internal positive control. Plasmid DNA prepared from 
each tube, ml DNA and DNA from the parent pool Z264-1800 
were transcribed and assayed in oocytes as described 
above. GlUgR activity was identified only in row (5) and 
column (e) • Thus, the positive clone number 45 was 
identified as containing the GlUgR activity. 

To confirm the result, plasmid DNA from clone #45 
was prepared, transcribed and assayed in oocytes as 
described above. The results of the assay indicated that 
clone #45 was capable of producing GlUgR activity. Figure 
2 illustrates the data taken from voltage-clamp 
recordings at several stages in the sub fractionation of 
the cerebellum library. Panel (a) is a recorded response 
to quisqualate of an oocyte previously injected with in 
vitro transcribed RNA from a rat cerebellum sublibrary of 
100,000 independent colonies; panel (b) shows the 
response to quisqualate in a cell previously injected 
with RNA transcribed from a subf ractionated pool of 
14,000 colonies. The peak current was truncated by the 
chart recorder, but the actual peak current (estimated 
from a digital panel meter) was approximately 1300 nA. 
Panel (c) shows the response to quisqualate in a cell 
injected with pure GlUgR RNA from clone 4 5 -A. The amount 
of RNA inject d per oocyte was approximately 100 ng, 
except in pan 1 (c) wher the amount f RNA was 50 pg. 

The following describ s an alternative m ans f r 
subdividing and screening a positive po 1. W rking with 
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cDNA ins rts< in a plasmid based rath r than a lambda- 
based vector influences the subfractionation protocols 
Once a positive pool is identified, the replica filter is 
overlayed with another sterile nitrocellulose filter. 
The filter is cut into 88 pieces by using evenly spaced 
cuts of 10 rows and 10 columns to form a grid. Each of 
the 88 pieces is transferred to 10 ml of sterile LB +Amp 
and grown for several hours. Twenty pools are formed; C 
1-10 (corresponding to column number) and R 1-10 
(corresponding to row number). An aliquot of each of the 
88 subtractions is pipetted into 2 tubes/ corresponding 
to its position in a row and a column. DNA is isolated 
from the 20 pools, purified on CsCl gradients and 
transcribed In an in vitro reaction that includes the 
control ml and SEAP plasmids. After injection into 
oocytes and voltage-clamp recording there are 2 positive 
pools, pinpointing the location of 1 of the 88 original 
subtractions. 

Because the positive clone is still part of a pool 
it must be further subdivided. The probability equation 
described above is used to determine the number of clones 
to be plated for the next subdivision of the pool. The 
glycerol stock from the positive pool is plated out at, 
e.g., 13000, 6000 and 18,000 clones per plate. After 
replica plating the DNA is harvested, transcribed, 
injected and assayed. The pool which is positive is 
subdivided into a grid of 88 as described above . The 
assay lis repeated, and a single square of the grid is 
positive. At the next step of subdivision of the pool, 
100 individual colonies to a plate are picked, replica 
plated, | and 20 pools are made for transcription and 
assay. \ Positive clones are streaked out, several 
colonies picked and restriction mapped and template and 
transcript prepared for injection and assay. 
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<3iarflPterl$atiQn ofi giy^ 

To establish that th GlUgR encod d by clone 45-A 
couples to G-protein, clone 45-A Glu^J* RNA was transcribed 
and injected into oocytes as described above. Two days 
5 after injection the oocytes were divided into control and 

toxin-treated groups. The oocytes in the toxin-treated 
group were treated with a final concentration of 4 Mg/ml 
of fi. pertussis toxin (List Biological Laboratories Inc., 
Campbell, CA) , and both groups were incubated for 24 

10 hours at 19 *C as~ described by Sugiyama et al., Nature 

325:531 (1987) and Moriarity et al., J. Biol. Chem. 
264:13521 (1989), both of which are incorporated by 
reference herein . The oocytes from both the control and 
toxin-treated groups were subjected to voltage-clamp 

15 assays as described previously. In one example, oocytes 

perfused as described previously with 100 iM L-glutamic 
acid showed a mean L-glutamic acid-induced current of 
264.2 nA '•'+/- 73 nA in control oocytes (SEM, n=6) and 57.7 
nA +/- 19 nA (n«9) in toxin-treated oocytes. The mean 

20 membrane current in the toxin-treated group was 

significantly smaller (p < 0.01) than in the control 
group suggesting that oocytes injected with 45-^A RNA 
coupled to a pertussis toxin-sensitive G protein. 

L-glutamic acid and some of its structural 

25 derivatives that are known to activate GIUqR currents in a 

dose-dependent manner were applied to oocytes that had 
been injected with RNA transcribed from the 45-A clbn4. 
RNA was transcribed and oocytes were prepared and 
injected as previously described. Dose dependent 

30 responses were measured using voltage clamp assays were 

carried out in the presence of increasing concentrations 
of L-glutamic acid (Sigma), quisqualic acid (Sigma) , 
ibotenic acid (Sigma), or trans l-amino-cyclopentyl-1,3 
dicarboxylic acid (tACPD; Tocris Neuramin, Essex, 

35 England). Fotbr or five separate pcytes were perfused 

with increasing concentrations of a particular drug with 
30 minutes betw en consecutive applications of the drug 
to minimize any int rf rence from desensitization. The 
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responses were normal iz d to a subsequent response to 100 
MM L-glutamic acid. The data were analyzed using the 
following equation: 

(Fractional current) = (Dose 0 )/ (Dose 11 ) + (EC 50 ) n , 

■ where: ' 

Dose - a dose of drug normalized to that evoked by 
a subsequent application of 100 iM L-glutamic acid; 

Fractional current - the peak current evoked by a 

dose, as defined above; 

EC - effective concentration that evokes a 50% 

' 50 

response (a measure of the potency of an agonist) ; and 

n = the Hill coefficient, a measure of . the 
cooperativity of the reaction. 

Using this equation, the effective concentration 
at 50% stimulation relative to 100 m» L-glutamic acid was 1 
determined for each dose response experiment.. Figure 6 
shows a representative dose response curve for varying 
concentrations of L-giutamic acid. The potency, series of 
glutamate analogs and their associated EC 50 's are listed 
in Table 2. 

Table 2 . ;; '■'.[ 

G1 ufcamate Analog Potencies f BfcJL 
Quisqualic acid 0.681 MM 
L-glutamic acid 12.32 MM 
Ibotenic acid 32.37 MM 

tACPD ■ 376 MM. /• ■■• 

. In addition, oocytes were exposed to "the following 
L-glutamic acid analogs: aspartic acid (Tocris Neuramin) , 
kainic acid, N-methyl-D-aspartic acid (NMDA; Sigma) , 
2 -amino-4 -phosphonobytyric acid (APB; Sigma) ; 
a-amino-3-hydroxy-5-methyl-isoxazole-4-propionic acid 

(AMPA; Research Biochemicals Inc., Way land, MA) at 
saturating cone ntrations and th responses were each 
n rmaiized to a subsequent r sponse t 100 MM 
L-glutamate . Th L-glutamic acid analogs that were found 
to b ineff ctive were 1 mM aspartic acid, 1 mM kainic 
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acid> 100 MM NMDA + 10 MM glycine, 100 pK APB and 100 fiK 
AMPA. 

Voltage clamp assays were also carried out on 
injected oocytes to measure the inhibition by the 
putative glutamate G protein-coupled receptor antagonist, 
2-amino-3-phosphonopropionic acid (AP3) . Voltage clamp 
assays showed that at 1 mM, DL-AP3 (Sigma) reduced the 
current evoked by 10 /xM glutamic acid to 59.3 +/~ 7-3% of 
the control . 

Clone 45 cells were streaked out on LB Amp plates 
and several colonies were picked, grown up and the DNA 
isolated. Pure 45-A DNA was prepared and restriction 
mapped by standard procedures. Clone 4 5 -A has been 
deposited with the American Type Culture Collection, 
12301 Parklawn Drive, Rockville, MD, 20852, under ATCC 
Accession No. 68497. DNA was digested with single or 
multiple enzymes. The fragments were separated on both 
1% agarose and 4% Nusieve gels by electrophoresis. After 
electrophoresis the DNA was transferred to nitrocellulose 
filters using standard protocols for Southern transfer. 
Restriction sites were mapped based on size and based on 
hybridization to Pst I subclones of 4 5-A DNA. 
Additionally, the entire 45-A cDNA insert can be isolated 
by digestion with Not I restriction endonuclease. The 
Not I insert was kinased with 7" 32 ? ATP, and after 
digestion of half of the sampler with Bam HI to remove the 
3' label, both samples were subjected to digestion with a 
number of enzymes known to be present once in the insert. 
In this way the unique sites could be localized. A 
restriction map of Glu^R clone 45-A is shown in Figure 3. 

The entire 45-A clone was sequenced in both 
directions using the dideoxynucleotide chain termination 
method (Sanger and Coulson, J. Mol. Biol. 94:441 (1975), 
incorporated herein by reference) . Figtire 5 (Sequence ID 
Nos. 1 and 2) shows the DNA sequence and deduced amino 
acid sequ nee of clone 45-A. Figure 5 also shows th 
location of putative N-linked glyc sylation sites, which 
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hav been predicted to occur at th amino acid s quence 
Asn-X-Thr. 

As shown in Figure 5, seven putative transmembrane 
domains have been predicted from the deduced amino acid 
5 sequence of clone 45-A using the method described by 

Eisenberg et al. .t. Mol. Bioi. 179:125-142, (1984) , 
incorporated herein by reference. Only those predicted 
to be transmembrane multimeric domains were included. An 
additional transmembrane domain (the third) was predicted 

10 using the method of Hopp and Woods, prop, Natl, Agad> 

sci. USA 78:3824-3838 (1981). Based on these 
predictions, the protein encoded by clone 45-A appears to 
have two unusually large domains on the amino- and 
carboxy-termini that are not found in any of the other 

15 reported 6 protein-coupled receptors which have the 

common structural feature of seven predicted membrane 
spanning regions. Analysis of the deduced amino acid 
sequence of clone 45-A predicts three other hydrophobic 
stretches including one at the amino-terminus of the 

20 sequence. This aminorterminal hydrophobic stretch may be 

a signal sequence, although no signal cleavage site is 
predicted downstream of the sequence. 

Poly (A) + RNA was isolated from total rat brain and 
rat cerebellum using oligo d(T) cellulose chromatography 

25 as described by Aviv and Leder (ibid.) . Poly (A) + RNA 

from rat retina, rat heart, rat lung, rat liver, rat 
kidney, rat spleen, rat testis, rat ovary and rat 
pancreas were purchased from Clonetech. The poly (A) + RNA 
samples were analyzed by northern analysis (Thomas, proc, 

30 Nflt 1 - Sel ' USA 77:5201-5205 (1980), Which is 

incorporated by reference herein). The RNA was denatured 
in glyoxal, electrophoresed in agarose and transferred to 
a nitrocellulose membrane essentially as described by 
Thomas (ibid.) . The northern blot was hybridized with a 

35 radiolabeled 3473 bp Ec Rl-Xba I fragm nt from th 45-A 

clon . Autoradiography of th blot show d hybridizatibn 
. to a maj r band f approximat ly 7 kb and a smaller band 
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of approximately 3.8 kb in the total rat brain and rat 
cerebellum RNA. 

Single-stranded cDNA was synthesized using 1 Mg of 
the poly (A) + RNA using Superscript reverse transcriptase 
(BRL) under; conditions described by the manufacturer. 
One fourth of the cDNA was used as a template for PGR 
amplification using 40 pinoles each of the GluGR-specif ic 
primers ZC3652 (Table 1; Sequence ID Number 14) and 
ZC3654 (Table 1; Sequence ID Number 15) and 2.5 U Taq I 
polymerase (Perkin Elmer Cetus, Norwalk, VA) and 
conditions specified by the manufacturer. As an internal 
control, the PCR reaction also contained 2 pmoles each of 
the glucose-6-phosphate dehydrogenase- spec if ic primers 
ZC3015 (Table 1; Sequence ID Number 12) and ZC3016 (Table 
1; Sequence ID Number 13). After thirty cycles (one 
minute at 94 f C f one minute at 60 # C, ninety seconds at 
72 *C), the samples were phenol-chloroform extracted and 
20% of each reaction was electrophoresed in agarose. The 
DNA was bidirectionally transferred to nitrocellulose 
membranes, and the filters were hybridized with either 
radiolabeled ZC3652, ZC3654, ZC3015 and ZC3016 (Sequence 
ID Nos. 14, 15, 12 and 13, respectively) or with the 
radiolabeled Eco Rl-Xba I fragment of clone 45-A 
described above. Autoradiography of the hybridized blot 
showed that GlUgR transcript was mainly confined to total 
rat brain and rat cerebellum; however, longer exposures 
showed a Glu^jR-specific transcript in both retina and 
test lis . 

; Total RNA was prepared, as described above, from 
specific rat brain regions including frontal cortex, 
cerebellum, hippocampus, cortex, striatum, pons medulla, 
and the remainder of the brain. Single-stranded cDNA was 
synthesized as described previously using 20 fig of total 
RNA in 50 Ml using Superscript reverse transcriptase 
(BRL) .under conditions described by th manufactur r. 
After [a one hour incubation at 42 'C, the samples w re 
treated with RNAse (Boehringer Mannheim Biochemicals , 
Indiahapolis, IN), ph nol-chloroform extracted, and 
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ethanol precipitated. The samples were resusp nded in 
water and half of each sample was subj cted to PCR 
amplification. Each PCR amplification contained 40 
pmoles of each of the GlUgR-specific primers ZC3652 and 
ZC3654 described above (Sequence ID Numbers 14 and 15) , 2 
pmoles of each of the glucpse-6-phosphate 
dehydrogenase-specific primers ZC3015 and ZC3016 
(Sequence ID Nos. 12 and 13) and 2.5 U Taq I polymerase 
(Perkin Elmer cetus) and conditions described by the 
manufacturer. After 35 cycles (one minute at 94 *C, one 
minute at 60«C, ninety seconds at 72 'C) , the samples were 
phenol-chloroform extracted, and 20% of each reaction was 
electrophoresed in agarose. The DNA was transferred to a 
nitrocellulose membrane, and the filter was hybridized 
with the radiolabeled Eco Rl-Xba I fragment of clone 45-A 
described above. Autoradiography of the hybridized blots 
showed a broad distribution of the GlUgR transcript 
throughout the brain, although the frontal cortex and 
cerebellum appear to be somewhat enriched. 

Southern analysis of rat and human genomic DNA was 
carried out using the method essentially described by 
Blin et al. rwue: Acids Res. 3 ; 2303 (1976), which is 
incorporated by reference herein). Briefly, rat and 
human genomic DNA was prepared from the rat cell line DMR 
106 (ATCC CRL 1661) and a human hepatoma cell line (ATCC 
HTB 52) , respectively. The genomic DNA was digested with 
either Eco RI or Pst I, and electrophoresed through 
agarose. The DNA was transferred to a nitrocellulose 
membrane, and the membrane was hybridized with a 
radiolabeled 1.6 kb Pst I fragment from clone 45-A. 
Autoradiography of the hybridized blot suggest that the 
human gene has a similar sequence to the rat GlUgR 
sequence, the GlUgR gene contains at least one intron, and 
that there are a small number of closely related genes. 

Expressio n \T\ Ml!)* 1 lan c lls 

The entire GlUgR cDNA insert was removed from the 

pVEGT' cloning v ctor by digestion with Not I and Xba I. 
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The ends wer blunt d with DNA polymerase I (Klenow 
fragm nt) and dNTPs, and were th n ligated with Eco RI 
(Smart) linkers. After linker ligation, the insert with 
Eco RI ends in kinased and ligated to Eco Rl-cut and 
capped Zem228 expression vector. Bacteria were 
transformed with the ligation reaction and clones were 
characterized by restriction analysis and partial 
sequencing (see Fig. 4) . 

Cultured mammalian cells, such as BHK 570 and BHK 
tsl3 served as host cells for expression. Twenty five /xg 
of CsCl -purified DNA was precipitated with calcium 
phosphate and added to tissue culture cells in a 150 mm 
plate . After 4 hours the cells were sub j ected to a 
glycerol shock and were then put into non-selective 
medium* In some cases it may be necessary to include an 
antagonist to the Glu^R in the medium to prevent 
expression of a cytotoxic response in those cells where 
the GlUcR is expressed at levels high enough to cause a 
certain amount of autoactivation. Transiently expressed 
GlUgR ligand binding activity or PLC activation, cells are 
harvested after 48 hours. Stable expression was detected 
after 2 weeks of selection. The Zem228 expression vector 
includes a promoter capable of directing the 
transcription of the GIUqR gene, and a selectable marker 
for the bacterial neomycin resistance gene. Resistance 
to the drug G-418, an inhibitor of protein synthesis, was 
used to identify stably trans fee ted clones. Presence of 
the SV40 ori region on the vector allows the expression 
construction to also be used for transient expression. 
In some instances it was preferable to include DNA for 
another selectable marker, the DHFR gene, in the 
transfection protocol. Selection with both G-418 and 
methotrexate allowed isolation of clones whose expression 
of GIUqR can be subsequently amplified by the addition of 
incr asingly higher concentrations of methotrexate to the 
culture medium.- 

Transf ected c 11 lines expressing Glu^ wer 
identified by the binding of 3 H-glutamate t membrane 
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preparations from transf cted cells/ Cell lines 
expr ssing low to moderate levels of GlUgR ar used to set 
up functional screening assays. 

Clones of BHK 570 and BHK TK~tsl3 cells expressing 
the rat G protein-coupled glutamate receptor cDNA were 
plated in two or three 150 mm maxi-plates culture dishes 
and were grown to confluency. The cells from each plate 
were scraped in 5 ml of PBS (phosphate buffered saline, 
Sigma Chemical Co., St. Louis, MO), which was was 
pre-chilied to 4*C. The cells were removed to a chilled 
centrifuged tube, and the plates were each rinsed with 5 , 
ml of chilled PBS and pooled with the cells. The chilled 
tubes were spun at 1,000 rpm for two minutes, and the 
supernatant was discarded. The cells were frozen at 
either -70 TC or on dry ice. In some cases, the cells 
were left overnight at -70 *C. The cells were thawed on 
ice and were resuspended in 10 ml of a buffer containing 
30 mM Tris, pH 7.0, 2. 5 mM CaCl 2 , 1 mM PMSF, which was 
pre-chilled to 4*C, by homogenizing the cells for about 
15 seconds. The suspension was poured into chilled 
centrifuge tubes. The homogenizer was rinsed with 10 ml 
; of the same chilled solution, and the rinse was combined 
With the suspension. The centrifuge tubes were spun for 
fifteen minutes at 40,000 x g at 4*C, and the supernatant 
was discarded. The pellet was homogenized with a buffer 
containing 30 mH Tris, pH 7.0, 2.5 mM CaCl 2 , which was 
pre-chilled to 4*C. The homogenizer was rinsed with the 
chilled buffer, and the rinse was combined with the 
homogenate. The homogenate was spun as described above . 
The second homogenization was repeated on the resulting 
pellet. The final pellet was resuspended in between two 
and five milliliters of 30 mM Tris, pH 7.0, 2.5 mM CaCl 2 , 
which was pre-chilled to 4*C. Triplicate samples were 
prepared for each plus and minus quisqualate assay point 
such that 250 Ml aliquots of each homog nate sample w r 
add d to th w lis of a 96-well microtiter plate. To a 
buff r containing 30 mM Tris, pH 7.0, 2.5 mM CaCl 2 , which 
was pre-chiiled to 4'C, a final concentration of 10 nM 
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tritiated glutamic acid was added, and the solution was 
split in half. To one half, quisqualate was added to a 
final concentration of 1 mM. Two hundred and fifty 
microliter aliquots of either 30 mM Tris, pH 7.0, 2.5 mM 
5 CaCl 2 , 5 nM tritiated glutamic acid and 500 mM 

quisqualate, or 30 mM Tris, pH 7.0, 2.5 mM CaC12, 5 nM 
tritiated glutamic acid were added to the triplicate 
samples. The samples were incubated for thirty minutes 
at room temperature. The samples were harvested onto 

10 glass filters and were immediately washed with ice-cold 

30 mM Tris/ pH 7.0, 2.5 mM CaCl 2 under vacuum using an LKB 
1295-001 automated cell harvester (Pharmacia LKB, 
Piscataway, NJ) . The filters were dried in a microwave 
oven and counted in a gamma counter. 

15 Protein determinations were carried out using a 

Coomassie Blue-based assay from Pierce Chemical Company 
(Rockford, IL) under conditions set forth by the 
manufacturer. One hundred microliters of undiluted cell 
homogenate or BSA standard was added to 2 ml of reagent 

20 and the optical density was measured at 595 nm. Protein 

concentrations of the samples were taken from a standard 
curve generated using the BSA standards diluted in 30 mM 
Tris, pH 7.0, 2.5 mM CaCl 2 . 

The results of these assays showed that 

25 quisqualate was able to competitively bind the glutamate 

receptor expressed by the transfected BHK cells. 

Functional screeni ng of agonists and antagonists 

30 BHK 570 cells expressing GluGR or mock-trans fee ted BHK 

570 cells are plated into 24-weli tissue culture dishes 
at about 100,000 cells per well. After 24 hours, the 
cells are labeled with 0.2 ixCi of myo-(2- 3 H) inositol 
(specific activity - 20 Ci/mmol; New England Nuclear, ) 

35 per w 11. At the nd of a 24 to 48 hour incubation, the 

ceils are washed with prewarmed DMEM (Dulbecc *s Modified 
Eagles Medium; Product No. 51-432, JRH Biosci nces, 
Len xa, KS) which has been buffered t pH 7.4 with Hepes 
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buff r (Sigma Chemical Co.) containing 10 mM Llcl, and 
are incubat d for fiv minutes at 37. The selected drugs 
are then added and the cells are incubated for an 
additional thirty minutes at 37* c. The reaction is 
stopped by placing the cells on ice, and the cells are 
lysed by aspirating off the media and adding 0.5 ml of 
cold DMEM and 0.5 ml of ice-cold 10% perchloric acid. 
After ten minutes the cell lysate is transferred to a 
tube on ice containing 250 Ml 10 mM EDTA, pH 7.0. The 
samples are neutralized with 325 Ml of 1.5 M KOH in 60 mM 
Hepes Buffer. After the precipitates settles, 1.0 ml of 
the supernatant is applied to an Amprep minicolumn 
(Amersham, Arlington Heights, IL, RPN1908) . Inositol 
phosphates are eluted of f the column and samples are 
counted in a scintillation counter. A positive response 
V is indicated by an increase in labeled inositol phosphate 
levels. 



Screening for additi onal qlutamafce receptor subtypes - 

Additional glutamate receptor subtypes were 
isolated using probes derived from clone 45-A. Glutamate 
receptor subtypes were isolated from a total rat brain 
cDNA library in Lambda Zap II, which was size selected 
for inserts of 3 kb before ligation (prepared for Terry 
Snutch, Ph.D. , University of British Columbia, Vancouver, 
British Columbia, Canada by Stratagene Cloning Systems, 
La Jolla, CA) and a rat cerebellum cDNA library in Lambda 
Zap II, which was size selected for inserts of 3 kb 
before ligation (Stratagene Cloning Systems, La Jolla, 
CA) . 

The total rat brain library and the rat cerebellum 
library were plated out with £. coll XL-i cells onto NZY 
agar plat s (Tabl 3) to obtain approximately 2.1 x 10 
plaques. Clone 45-A; encoding subtype la, was digested 
with Pst I to isolate the 1.3 and 1.6 kb fragments. . The 
45-A. Pst I fragments wer labeled by random priming using 
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the Amersham random-priming kit (Am rsham, Arlington Hts, 
IL) . Duplicate lifts were prepar d from the plates , and 
the filters were hybridized with the probes in 50% 
formamide at 37 *C. After an overnight hybridization, the 
filters were washed in 2x SSC + 0.1% SDS at 50 # C. 
Positive plaques were isolated by several rounds of 
dilution plating and repeated screening with the 
random-primed probes. 

Table 3 

NZY Agar 

To 950 ml of deionized water, add: 

10 g NZ amine: Casein hydrolysate enzymatic (ICN 

Biochemicals) 

5 g NaCl 

5 g bacto-yeast extract 

1 g casiamino acids „ 

2 g MgS0 4 * 7H 2 0 

Shake until the solutes have dissolved, Adjust to 
pH 7.0 with 5 N NaOH (approximately 0.2 ml). Adjust the 
volume of the solution to 1 liter with deionized H 2 0. 
Sterilize by autoclaving for 20 minutes. 

2QX S?C 

Dissolve 175.3 g NaCl and 88.2 g sodium citrate in 
800 ml H 2 0. Adjust the pH to 7.0 with a few drops of 10 N 
NaOH. Adjust the volume to 1 liter with H 2 0. Sterilize 
by autoclaving. 

Plasmid DNA was prepared from positive plaques 
using the Bluescript system (Stratagene Cloning Systems). 
The plasmid DNA was subjected to restriction analysis and 
Southern blot analysis (Sambrook et al. f ibid., which is 
Incorporated herein by reference). Two clones, SN23, 
d rived from-th total rat brain library, and SR2, 
derived from the rat cereb Hum library, w re id ntified 
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as being different than the 45-A clone and were 
sequenced. Sequence analysis sh wed that they 
represented two additional subtypes. SN23 encodes 
subtype lb, which contains an additional 85 bp exon that 
encodes a new stretch of 20 amino acids and a stop codon 
in the intracellular domain, is 292 amino acids shorter 
than the 45-A clone. The nucleotide sequence and deduced 
amino acid sequence of clone SN23 are shown in Fig. 7. 
SR2 was found to contain a partial cDNA sequence encoding 
subtype 2a, which is a novel sequence that shares a 42% 
homology to the transmembrane domains and extracellular 
domain of the 45-A clone. 

A complete subtype 2a clone was obtained by 
rescreening both libraries as described above with the 
radiolabeled i.3 kb Pst I fragment from clone 45-A and a . 
radiolabeled 1.4 kbEco RI-Pvu II fragment from SR2. Two 
additional clones were obtained. SN30, derived from the 
total rat brain library, contained the entire subtype 2a 
coding sequence. The nucleotide sequence and deduced 
amino acid sequence of clone SN30 are shown in Fig. 8. 
SR13, derived from the rat cerebellum library, contained 
an incomplete sequence of a new receptor subtype, 2b. 
Sequence analysis of SR13 showed that the coding sequence 
was incomplete at the 3 • end and was virtually identical 
to the SN30 sequence except that it contained a 610 base 
pair deletion within the 3' terminus of SN30. The DNA 
sequence of the cDNA insert in clone SR13 is shown in 
Figure 9. 

The complete 3' end of the subtype 2a clone was 
generated using PCR amplification and an oligonucleotide 
containing a sequence unique to SR13 (ZC4520, Table 4) 
and an oligonucleotide corresponding to a sequence near 
the 3/ end of the 3' non-translated region of SN30 
(ZC4519, Table 4). DNA was prepared from plate ly sates 
of the original plating of ach library. Each plate 
' pr duced a pool of clones. For the PCR reactions ,- ten 
nanograms from each library and 100 pmol of each 
oligonucleotid w r combin d in a reaction volume of 50 
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Ml containing 50 mM KC1, 10 mM Tris-HCl pH 9.0, 1.5 mM 
MgCl 3 , 0.1% Triton X-l 00, 0.01% gelatin, 0.2 nM each 
deoxynucleotide triphosphate and 2.5 units of Thermus 
aauaticus (Tag) DNA polymerase (Promega Corporation, 
Madison, WI) . The reaction mixture was overlaid with 
mineral oil. After five cycles (30 seconds at 94 *C, 30 
seconds at 45 *C, 1 minute at 50'C) and twenty-five cycles 
(30 seconds at 94*.C, 30 seconds at 45*C, 1 minute at 
72 *C) the amplified DNA was removed for analysis. 



4 

Degenerate Oligonucleotide Primer Seaueneea (5' - 3M 
ZC4519 

TTT ATT A6A AAT GTT CTC G6T 
15 • ZC4520 ' 

CCT CTT CCA' TAT TTT TCC ATT 
ZC4559 ; 

ATA AG A ATT CAT NKR YTT NGC YTC RTT RAA 
ZC4560 

20 ATA AGA ATT CTT YRA YGA RAA NGG NGA YGC 

ZC4561 ' 

ATA AGA ATT CGC NGG NAT HTT YYT NKG NTA 
ZC4562 

ATA AGA ATT CTA NCM NAR RAA DAT NCC NGC 
25 ZC4563 

ATA AGA. AAT CAN GTN GTR TAC ATN GTR AA 

An aliquot from each reaction was elect rophoresed 
on agarose and transferred to nitrocellulose for Southern 

30 analysis. Southern analysis of the PCR products showed 

that a 460 bp fragment corresponding to the 3' end of the 
2b sequence was present in several pools. One of the 
pools that produced the correct size PCR product encoding 
the 3' sequence of the 2b subtype was diluted and 

35 screened with radiolabeled ZC4 519 and ZC4520 (Table 4). 

Phage that hybridiz to both radiolab led ZC4519 and 
ZC4520 are picked, eluted, dilut d, plated and rescreened 
with the oligonucleotide probes. The screening is 
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repeated until a pure cl on is obtain d. Th pur clone 
is sequenc d, and a full-length clon is construct d 
using "the most convenient restriction enzyme (s). 

Based oil an alignment of the deduced amino acid 
sequences of subtypes la and 2a, strategies were designed 
for cloning additional subtypes using PGR amplification. 
Degenerate oligonucleotide families were prepared to 
encode conserved amino acid sequences in the sixth 
transmembrane domain > a region surrounding the conserved 
amino acid sequence Phe-Asp-Glu-Lys , the third 
cytoplasmic loop, and the second transmembrane domain 
(Table 4) • 

Glutamate receptor cDNA sequences were amplified 
with pairs of degenerate primers from Table 4 using the 
PCR method on cDNA from the total rat brain library, the , 
cDNA from the rat cerebellum library, a rat cortex cDNA 
library or a rat hippocampus cDNA library (both obtained ■ 
from Michael Brownstein, National Institutes of Health, 
Bethesda, HD) . The primers also each contained a 5 ' tail 
of 10 nucleotides, which provided convenient restriction 
enzyme sites. For each PCR reaction, ten nanograms from 
the library and 100 pmol of the oligonucleotide pools 
ZC4563 and ZC4560 (Table 4) were combined in a reaction 
volume of 50 containing 50 mM KCl, t 10 mM Tris-HCl pH 
9. 0, 1.5! mM MgCl 2 , 0.1% Triton X-100, 0.01% gelatin, 0.2 
mM each deoxynucleotide triphosphate and 2.5 units of Taq 
DNA polymerase. The reaction mixture was overlaid with 
mineral oil. After five cycles (30 seconds at 94 *C, 30 
seconds it 45 *C, 1 minute at 50 # C) and twenty- five cycles 
(30 seconds at 94*C, 30 seconds at 45*0, 1 minute at 
72 *C) the; amplified DNA was removed for analysis. 

An! aliquot from each reaction was electrophoresed 
on an agarose gel. Southern analysis of the gel was 
performed; using essentially the method described by 
Sambrook et al. (ibid.) and random-primed fragments 
covering the entire coding r gions from both the subtype 
la and 2a j clones. The autoradiographs showed that the 
PGR reaction gen rated fragments of novel size that wer 
i ■ 

i 
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different from either the la or 2a subtype* The 
PCR-generated fragments v re lectrophoresed on an 
agarose gel. Regions corresponding to the unique-sized 
receptor-related products were excised and 
electrophoresed onto NA45 paper (Schleicher and Schuell, 
Keene, NH) . The purified fragments were recovered using 
essentially the method described by the manufacturer, 
digested with Eco RI and ligated to plasmid pVEGT' that 
had been linearized by digestion with Eco RI and treated 
with phosphatase to prevent recircularization. The 
ligation mixtures were transformed into £. coli strain 
DHlOb cells. Trans formants were picked and replica 
plated onto nitrocellulose filters and screened using 
random-primed probes from the la and the 2a clones. 
Forty-eight colonies were picked for restriction analysis 
and sequencing. 

DNA sequences from the cDNA from the total rat 
brain /library and the cDNA from the rat cerebellum 
library were each amplified and analyzed using the 
methods described above and oligonucleotide ZC4559. in 
combination with either ZC4561 or ZC4559 (Table 4) . 

A rat cortex cDNA library and a rat hippocampus 
cDNA library (both obtained from Michael Brownstein, NIH) 
are subdivided into 30 pools of 10,000 colonies. Plasmid 
DNA is prepared from each pool, and the DNA is subjected 
to Southern analysis after restriction digestion of the 
pools with Bam HI and Xho I or by PCR amplification of 
each pool using the degenerate oligonucleotides of Table 
4. The library pools containing DNA that hybridize to 
the probes and appear to contain a full-length cDNA are 
subdivided. The plasmid DNA is prepared and screened as 
described above./ Positive pools are again divided and 
the procedure, is continued Until the pool is reduced to 
pure clones. The clones are subjected to restriction 
analysis and partial sequence analysis. Clones that 
represent distinct glutamate receptor homologs are 
compl t ly s qu need. Full length cion s ar g n rated 
by subjecting the original pools to PCR amplification 
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using an oligonucleotide primer specif ic to the SP6 
promoter at the 5" end of the cDNA insert and an 
antisense oligonucleotide priaer corresponding to the 5* 
end of the most complete cDNA to identify pools that 
5 contain the longest glutamate receptor homolog cDNA. The 

pool is then diluted and rehybridized with the probes as 
described above to isolate a full length cDNA clone. 

Expression of Glut amate Rec«»ptoi- Subtypes 

10 Complementary DNA sequences encoding subtypes lb 

and 2a were subcloned first into the mammalian expression 
vector Zem228R to obtain convenient terminal restriction 
sites. The cDNAs were then subcloned into pVEGT ' . The 
cDNA sequence encoding subtype lb was constructed by 

15 replacing the 3' terminal portion of subtype la described 

in Example I with the analogous portion of subtype lb 
from SN23. Plasmid SN23 was digested with Kpn I and Xba 
I to isolate the fragment containing the 3* terminus of 
the lb subtype. The plasmid containing the subtype la 

20 coding sequence (45-A) in Zem228R was digested with Kpn I 

and Xba I to isolate the vector containing fragment. The 
vector containing fragment is ligated to the Kpn I-Xba I 
fragment from SN23. The resulting plasmid comprises the 
MT-1 promoter, the subtype lb cDNA and the hGH 

25 terminator. This plasmid was transfected into the BHK 

570 cell line essentially as described in Example I to 
obtain stably transfected cell lines expressing the 
subtype lb receptor. The subtype lb cDNA fragment was 
isolated as a Bam HI fragment, which was ligated with 

30 pVEGT' that had been linearized with Bam HI. A plasmid 

containing the cDNA sequence in the correct orientation 
was used to synthesize RNA in an in vitro system. The 
RNA was injected into oocytes as described above. 

Plasmid SN30, which comprises the subtype 2a cDNA, 

35 was digested with Eco RI to isolate the subtype 2a cDNA. 

Th Eco RI fragment was ligated with Eco Rl-lin ariz d 
Zem228R. A plasmid containing the insert in th correct 
ori ntation was digest d with Bam HI to isolate the cDNA 
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sequence. The Bam HI fragment comprising the subtype 2a 
cDNA was ligat d with Eco Rl-lineariz d pVEGT • • A 
plasmid containing the cDNA in the correct orientation 
was used to synthesize UNA in an in vitro translation. 
The RNA was injected into frog oocytes as described 
above. 



10 



15 



20 



25 



30 



,35 



EXAMPLE III 

Generation of antib odies to qlutamate receptor subtypes 
Receptor subtype-specific polyclonal antisera 
were generated in rabbits using standard immunization 
techniques. Synthetic peptides (Table 5) were designed 
from the cloned: receptor sequences. The peptides were 
conjugated to keyhole limpet hemocyanin, and each antigen 
was used to immunize two animals. For each peptide, the 
animals were injected with 100-200 of conjugated 
peptide divided among three subcutaneous sites. The 
animals were immunized at three-week intervals and bled 
via an ear vein 10 days after the third and subsequent 
immunizations. 



Subtype 
la 


Seq. ID 
NQ, 


Peptide 
Sequence 


. Apparent 
Location 


21 


RDSLISIRDEKDGLNRC 


extracellular 




22 


DRLLRKLRERLPKARV 


extracellular 




23 


EEVWFDEKGDAPGRYD 


extracellular. 




24 


EFVYEREGNTEEDEL 


cytoplasmic 




25 


PERKCCEIREQYGIQRV 


extracellular 




26 


IGPGSSSVAIQVQNLL 


extracellular 




27 


IAYSATSIDLS DKTL 


extracellular 


lb 


28 


KKPGAGNAKXRQPEFS 


cytoplasmic 




29 


PEFSPSSQCPSAHAQL 


cytoplasmic 


2a 


30 


DKIIKRLLETSNARG 


extracellular 




31 


VNFSGIAGNPVTFNEN 


extracellular 




32 


GEAKSELCENLETPAL 


cytoplasmic 


2b 


33 


PARLALPANDf EFSAWV 


cytoplasmic 



40 
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Anti-peptide antibodies w r purified by affinity- 
purification using the Proton™ Kit (Multiple Peptide 
Systems (San Diego, CA) . Purified antibodies were stored 
in column elution buffer and neutralizing buffer 
5 (supplied by Multiple Peptide Systems) . Bovine serum 

albumin was added to a concentration of 1 mg/ml , and 
sodium azide was added to a concentration of 0.05%. The 
antibodies were stored at 4*C or in small aliquots at 

.'-20*c. 

10 Antibodies generated from the peptides listed in 

Table 6 were used to detect G protein-coupled glutamate 
receptors by Western blot analysis of membranes prepared 
from transfected cell lines that were stably expressing 
the subtype la or subtype lb receptors. Control cell 

15 lines were transfected with vector alone. 

Table 6 

Analysis of Antibodi es Raised to Peptides 

20 Antibodies to Seq. ID 

Pentide Sequence No. legation western 

RDSLISIRDEKDGLNRC 21 extracellular +++ with bkgd 
DRLLRKLRERLPKARV 22 extracellular + 
25 EEVWFDEK6DAP6RYD 23 extracellular ++++ low bkgd 

EFVYEREGNTEEDEL 24 cytoplasmic ++++ low bkgd 
KKPGAGNAKKRQPEFS 28 cytoplasmic + for la 

• - for lb 

PEPSPSSQCPSAHAQL 29 cytoplasmic +++ for lb low bkgd 

30 

Transfectants that were stably expressing either 

the la or lb subtype were each grown to confluency in 
five to ten 150 mm plates. Each plate was first washed 

35 twice with 15 ml of cold PBS and then 20 ml of ice cold 

10 mM NaHC03 was added to each plate. The cells from 
each plate were scraped off the plates with a rubb r 
spatula and transferr d to a glass dounc h mogeniz r on 
ice. The cells w re disrupted with ten strokes of the B 

40 pestle. Th homog nates from ach plate were combined 
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and centrifuged for thirty minutes at 3000 rpm at 4*C. 
The pellets w re resuspend d in 4-8 ml of 10 mM NaHCOj 
using a 22 g needle and syringe, and 69% sucrose was 
added (6-^12 ml) to each sample until an index of 
5 refraction of 1.410 was reached. The samples were 

transferred to a high speed centr if ugat ion tube, and each 
sample was overlayed with 42% sucrose. The samples were 
centrifuged for two hours at 25,000 rpm at 4* C. The 
samples were collected by gently floating the membranes 

10 off the 42% sucrose layer by adding 1 ml of 10 mM NaHC0 3 

and resuspending the membranes by carefully stirring the 
upper: layer. The upper layer was transferred to a fresh 
tube on ice. The purified membranes were centrifuged at 
10,000 rpm at 4°C and the pellets resuspended in 10 mM 

15 NaHC0 3 . The purified membranes were then adjusted to a 

final protein concentration of 1-2 ng/ml. 

Ten to twenty micrograms of each purified membrane 
preparations were diluted with 2x SDS-mercaptoethanol 
buffer (100 mM Tris HCl (pH 6.8) , 200 mM dithiothreitol , 

20 4% SDS, 0.2% bromophenol blue, 20% glycerol) . The, 

samples were incubated for 15 minutes at 37 *C followed by 
boiling for 5 minutes. The samples were subjected to 
SDS-PAGE on 4-15% gradient gel. The samples were/ 
el ectrotransf erred to nitrocellulose using the method 

25 essentially described by Towbin ( Proc , Natl . Acad . Sci . 

USA 76i 4350-4354, 1979; which lis incorporated herein by 
reference in its entirety). After transfer, the 
nitrocellulose was cut into strips such that each strip 
contained a control and receptor samples. The 

30 nitrocellulose was pre incubated in blocking buffer and 

then incubated with a dilution of either the preimmune 
serum or the serum collected after antigenic stimulation 
(serum from later bleeds (i.e. those after four antigen 
stimulations) were diluted 1:1500). After washing, a 

35 horse radish peroxidase-conjugated goat anti-rabbit 

antibody (Bio-Rad Laboratories, Richmond, CA) dilut d 
1:2,500 was added and after incubation and washing, the 
horse radish peroxidase substrate (Bio-Rad Laboratories) 
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was add d and the color reaction was initiated. The 
reaction was stopped by rinsing the* filters in distilled 
water. Table 6 shows the results of the Western blot 
analysis. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Mulvihill, Eileen R. 

Hagen, Frederick S. 
Houaiaed, Khaled M» 
Aimers, Wolf hard 

( ii) TITLE OF INVENTION: G PROTEIN-COUPLED GLUTAMATE; RECEPTORS 
(iii) NUMBER OF SEQUENCES: 33 

(iv) CORRESPONDENCE ADDRESS: 

(X\ ADDRESSEE: Townsend and Townsend , . 

(B, STREET: one Market Plaza, Steuart Street Tower 

(C) CITY: San Francisco 

* (D) STATE: California 

(E) COUNTRY: USA 

(F) ZIP: 94105-1492 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible . 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(Vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

fviil PRIOR APPLICATION DATA: 
(VII) ™ K A ^ pLICATI oN NUMBER: US 07/672,007 
(B) FILING DATE: 18-MAR-1991 

(Vii) PRIOR APPLICATION DATA: „ /£J10 Afl1 

(A) APPLICATION NUMBER: US 07/648,481 

(B) FILING DATE: 30-JAN-1991 

(vii) PRIOR~APPLI CATION DATA: n 

(A) APPLICATION NUMBER! US 07/626,806 
'(B) FILING DATE: 12-DEC-1990 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Parmelee, Steven W. 
B) REGISTRATION NUMBER: 31,990 

(C) REFERENCE/ DOCKET NUMBER: 13952-6PC 

fix) TELECOMMUNICATION INFORMATION: 
1 (A) TELEPHONE : (206) 467-9600 

(B) TELEFAX: (415) 543-5043 

(2) INFORMATION FOR SEQ ID NO:l: SUBSr j T UTE SHEET 
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fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4300 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: 45-A 

fix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 377.. 3973 

•<i) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
CCGAGAACGG CTGCAGTCCT CTGACCTGAG ACCAATAGCT GTGTCTACCC GGACTCAGCG 
TCCAGCTCAC CGCCACTAAC GCBCCGCGCA TTGGACACCT GATCCACACA CCTTCGGGCA 
CCAGTGAAAA ACCGCGACTT GATTTTCTGG AAGAACGCCC CCAGGGTGTG GGAGCGGTCG 
TGGAGGACCA GCAGGAGGAA GCGGAGGGGA GAGGGGCAGT AGTGGAGGCA GAGAAAGCGT 
TGAACCAGCT GTGTTGGCCG AAGGCACGAA ACGGCAAAAG GCAGCGGTGA GCATCTGTGT 
C^CCCGCT GGGAACCTGC AGGCAGGACC GGCGTGGGAA CGTGGCTGGC CCGCGGTGGA 
CCGCGTCTTC GCCACA ATG GTC CGG CTC CTC TTG ATT TTC TTC CCA ATG 

s s s s c e ssia s £ s s & £ - - 

-M BS.SS-K a?' BB SB ftS .5 S SS.BJ s 
~ S S 2S S S £ S £ 2S S S! S S! £ 

eimCTlTtlTE SHEET 



60 
120 
180 
240 
300 
360 
409 

457 

505 

553 

601 

649 

697 
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100 105 



„,„„ -v^ m rr rJk r TCT TCA GTG GCT CTC GAA CAG AGC ATC GAA TTC ATC 
S SS-S! 8* IS IS 5S Ala Leu Glu Gin Ser lie Glu Phe lie 



■asfe as a« s tg s = ~ £ i » - « 

190 195 -. 200 .. 

GTG GTC CCT TCT GAC ACT TTG CAG GCA ™ «fi' jKlS !E S S5 • 
val val Pro Ser Asp Thr Leu Gin Ala Arg Ala Met Leu Asp lie vaj. 
205 210 ,215 

is a ss as s? a s £ a a si as 

220 225 

£ S S S iS 5 E-2 SS S?'S S S S5 

240. 245 

S5 S £ if. Si S a ji'S SS 

255* 260 

■as aa si a 2s a a a as g a a sr 



asj s - a gi as a ss a a a a 

ssss ss25 is as a a sr a ss 



sassssassaassssis 

320 329 

SUBSTITUTE SHEET 



745 



110 I" 

irA GAC TCC CTG ATT TCC ATC CGA GAT GAG AAG GAT GGG CTG AAC CGA 
£g Sp S 2u 'iS Ser lie Arg Asp Glu Lys Asp Gly Leu Asn Arg 
125 I 30 ■ 135 

„ rr rm- CCT GAT GGC CAG ACC CTG CCC CCT GGC AGG ACT AAG AAG CCT 
cys 2u S»2J S, Sn Thr Leu Pro Pro Gly Arg Thr Lys Lys Pro 
140 145 1S0 

ATT GCT GGA GTG ATC GGC CCT GGC TCC AGC TCT GTG GCC ATT CAA GTC 
S S gU SS Gly Pro Gly Ser Ser Ser Val Ala lie Gin Val 

160 1«5 x/w 

CAS : aXT CTT CTC CAG CTG TTC GAC ATC CCA CAG ATC GCC. TAT TCT GCC 937 

Sn Ata 2u 25 Sn Leu Phe Asp lie Pro Gin lie Ala Tyr Ser Ala 

175 180 



793 



841 



889 



985 



1033 



1081 



1129 



1177 



1225 



1273 



1321 



1369 
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„ GAA CCC 4c GGA CCC ATC ACA £ £ CTT CAG TCT CCA GAG CTC 

Val Glu Ala Asn Gly Gly He Thr lie uys ^ 
335 34U 



saassaaaasaasaaaa 

365 370 

. _ ___ A rr A «ar TTT AAG AAA GTG TGC ACA 

CTA CCT GGA CAC CTC TTG GAA AAC CCC AAC TTT AAG . ^ T hr 
Leu Pro Gly His Leu Leu Glu Asn Pro Asn roe uys »x . j jg5 

r «» rai AAC TAT GTC CAG GAC AGC AAA ATG GGA 

:JS a SS «i a J2£w jg «- »p s sl * 

400 4UD 

■'saias ii« Ty? sa s a'as a-a a- 

415 420 
430 435 

a s-s ife is a s as a a as = a = 

445 450 

S »S IS S.S! a a a"a§ - a sss 

,460 ! 465 

CCC GGA AGG TAT GAC ATT ATG AAT CTG CAG TAC ^ ^ JCT AAT CGC 
Pro Gly Arg Tyr Asp He Met Asn Leu Gin Tyr «. 490 

. : 480 * ao 

• *w aee TGG CAT GAA GGA GTG CTG AAT ATT 

s a a srss a ss a. a «- » - n * 

495 

^sa^s-Sar^S"® 
ss'aij sss »■* ass ass a a si 



S2S 530 



w MM 4<" s « a a a a sMSfi a a 

Glu val Ser Cyp Cys Trp He Cys Tnr Axa g» 555 



540 545 



» CAS GAC OA* TTC ACC TGC AGA GCC TGT GAC CTC GGG TGG TGC COG 

SUBSTITUTE SHEET 



1417 



1465 



1513 



1561 



1609 



1657 



1705 



1753 



1801 



1849 



1897 



1945 



1993 



2041 



2089 
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640 



CCT ACT ACC ACA TCC TGC TAC CTC CAG CGC CTC CTA GTT GGC CTC TCP 
So tS -.JS:*S..S. Ss TyrlLeu Gln.Arg Leu Leu Val . Gly Leu Ser 
. 660 663 



655 



TCT GCC ATG TGC TAC TCT GCT TTA GTG ACC AAA ACC AAT .CGT ATT GCA 
£r £a Melcys Tyr Ser Ala. Leu val Thr Lys Thr Asn Arg He Ala 
670 . 675 680 

CGC ATC CTG GCT GGC AGC AAG AAG AAG ATC TGC ACC CGG AAG CCC AGA 
Arg lie Leu Ala Gly Ser Lys Lys Lys lie Cys Thr Arg Lys Pro Arg 
685 6?0 695 

TTC ATG AGC GCT TGG GCC CAA GTG ATC ATA GCC TCC ATT CTG ATT ACT 
Phe Met Ser Ala Trp Ala Gin Val He lie Ala Ser lie Leu lie Ser 

705 , 710 



700 



rri CAS CTA ACA CTA GTG GTG ACC, TTG ATC ATC ATG GAG CCT CCC ATG 
?£ Gin 2S T^ SS v£ val Thr Leu He He Met Glu Pro Pro Met 

725 . 



720 



CCC ATT TTG TCC TAC CCG ACT ATC AAG GAA GTC TAC CTT ATC TGC AAT 
So lie Leu ser Tyr Pro Ser lie Lys Glu Val Tyr Leu He Cys Asn 
735 740 

as«« s £ S? SlB SS 3! a i fee « a 

765 770 7/9 

.„ «n« *m. rir reT AAA TAC ATC GCC TTC ACC ATG TAC ACT ACC TGC 
AAC TTC. AAT GAG GCT AAA TAC Aiv. v»w*. ^ .jftj. xhT Cys 

Asn Phe Asn Glu Ala Lys Tyr He Ala Ph Thr let Tyr :nr i« i 



2137 



2185 



val- Gin Asp Glu Phe Thr Cys Arg Ala Cys Asp Leu Gly Trp Trp Pro 

560 565 

a&r GCA GAG CTC ACA GGC TCT GAG CCC ATT CCT GTC CGT TAT . CTT GAG 
aS .2S. Su Thr SlJ Cys Glu Pro He Pro Val Arg Tyr Leu Glu 
575 580 

TGG ACT GAC ATA. GAA TCT ATC ATA GCC ATC GCC TTT TCT TGC CTG GGC 
Trp ser Asp lie Glu Ser lie lie Ala lie Ala Phe Ser Cys Leu Gly 
* 590 595 . 600 

ATC CTC GTG ACG CTG TTT GTC ACC CTC ATC TTC GTT CTG TAC CGG GAC 2233 
ill S Sal Thr Su Phe Val Thr Leu He Phe Val Leu Tyr Arg Asp. 

; eos 6io 615 

ACA CCC CTG GTC AAA TCC TCC ACT AGG GAG CTC TGC TAT ATC ATT CTG 
Thr Pro val val Lys Ser Ser ser Arg Glu Leu Cys Tyr He He Leu 
',20 625 630 ' • 

GCT GCT ATT TTC CTC GGC TAT CTG TGC CCT TTC ACC CTC ATC GCC AAA 
Sa Gly He Phe Leu Gly Tyr val cys Pro Phe Thr Leu lie Ala Lys 

_ . _ 645 . oov 



2281 



2329 



2377 



2425 



2473 



2521 



2569 



2617 



2665 



2713 



2761 
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780- 785 790 795 

ATC ATC TGG CTG GCT TTC GTT CCC ATT TAC TTT GGG AGC AAC TAC AAG 2809 
lie lie Trp Leu Ala Phe Val Pro He Tyr Phe Gly Ser Asn Tyr Lys 

800 805 810 

ATC ATC ACT ACC TGC TTC GCG GTG AGC CTC AGT GTG ACG GTG GCC CTG 2857 
lie lie Thr Thr Cys Phe Ala Val Ser Leu Ser Val Thr Val Ala Leu 
815 320 825 

GGG TGC ATG TTT ACT CCG AAG ATG TAC ATC ATC ATT GCC, AAA CCT GAG 2905 
Gly Cys Met Phe Thr Pro Lys Met Tyr He He He Ala Lys Pro Glu 
830 835 840 

AGG AAC GTC CGC AGT GCC TTC ACG ACC TCT GAT GTT GTC CGC ATG CAC 2953 
Arg Asn Val Arg Ser Ala Phe Thr Thr Ser Asp Val Val Arg Met His 
845 850 / 855 

GTC GGT GAT GGC AAA CTG CCG TGC CGC TCC AAC ACC TTC CTC AAC ATT 3001 
Val Gly Asp Gly Lys Leu Pro cys Arg Ser Asn Thr Phe Leu Asn lie 
860 865 870 875 

TTC CGG AGA AAG AAG CCC GGG GCA GGG AAT GCC AAT TCT AAC GGC AAG 3049 
Phe Arg Arg Lys Lys Pro Gly Ala Gly Asn Ala Asn Ser Asn Gly Lys 

880 885 890 

TCT GTG TCA TGG TCT GAA CCA GGT GGA AGA CAG GCG CCC AAG GGA CAG 3097 
Ser Val Ser Trp Ser Glu Pro Gly Gly Arg Gin Ala Pro Lys Gly Gin 
895 900 905 

CAC GTG TGG CAG CGC CTC TCT GTG CAC GTG AAG ACC AAC GAG ACG GCC 3145 
His Val Trp Gin Arg Leu Ser Val His val Lys Thr Asn Glu Thr Ala 
910 915 920 

TGT AAC CAA ACA GCC GTA ATC AAA CCC CTC ACT AAA AGT TAC CAA GGC 3193 
cys Asn Gin Thr Ala Val lie Lys Pro Leu Thr Lys Ser Tyr Gin Gly 
925 930 935 

TCT GGC AAG AGC CTG ACC TTT TCA GAT GCC AGC ACC AAG ACC CTT TAC ' 3241 
Ser Glv Lys Ser Leu Thr Phe Ser Asp Ala Ser Thr Lys Thr . Leu Tyr - 
940 . - 945 950 955 

AAT GTG GAA GAA GAG GAC AAT ACC CCT TCT GCT CAC TTC AGC CCT CCC 3289 
Asn Val Glu Glu Glu Asp Asn Thr Pro Ser Ala His Phe Ser Pro Pro 

960 965 970 

AGC AGC CCT TCT ATG GTG GTG CAC CGA CGC GGG CCA CCC GTG GCC ACC 3337 
Ser Ser -Pro Ser Met Val Val His Arg Arg Gly pro Pro Val Ala Thr 
975 980 985 

ACA CCA CCT CTG CCA CCC CAT CTG ACC GCA GAA .GAG ACC CCC CTG TTC 3385 
Thr Pro Pro Leu Pr Pro His Leu Thr Ala Glu Glu Thr Pro Leu Phe 
990 995 1000 

CTG GCT GAT TCC GTC ATC. CCC AAG GGC TTG CCT CCT CCT CTC CCG CAG 3433 
Leu Ala Asp Ser Val He Pro Lys Gly Leu Pr Pr Pr Leu Pro Gin 
X005 1010 1015 
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as-as s as as s a a as as a a as as ss, 

1020 1025 

SS SB s as a ss ss s ©ss ss ss s ss 

S S S S SS a 2S £55 s ss s as £ 

1055 1060 
1070 1075 

..5 a ss a ss a as ss as as as a a a 
a ss a ss if. a s ss .a 'a sss ss a ss s 

noo i 105 111 

as a a a a a a ss aa a 2? a. a a. 

1120 1125 

GAG GAG GAG GAC CTG CCC ACA GCC AGC AAG CTG ACC CCT GAG OAT TCT 
Glu Glu Glu Asp Leu Pro Thr Ala Ser Lys Leu Tnr 
1135 1140 

a a a s a a a a a a a a as a ss 

1150 1155 

a a a a a sss a as a ss s s a 
a ss ass a sss a gas's as a 

1180 i" 5 119 

TCT TCC ACC CTG TAGTGTGTGT GTGTGTGTGG GGGCGGGGGG AGTGCGCATG 
Ser Ser Thr Leu 

GAGAAGCCAG AGATGCCAAG GAGTGTCAAC CCTTCCAGAA ATGTGTAGAA AGCAGGGTGA 
■ GGGATGGGGA TGGAGGACCA CGGTCTGCAG GGAAGAAAAA AAAAATGCTG CGGCTGCCTT 
AAAGAAGGAG AGGGACGATG CCAACTGAAC AGTGGTCCTG GCCAGGATTG TGACTCTTGA 
ATTATTCAAA AACCTTCTCT AGAAAGAAAG GGAATTATGA CAAAGCACAA TTCCATATGG 
TATGTAACTT TTATCGAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAA 



3481/ 

3529 

3577 

3625 

3673 

3721 

3769 

3817 

3865 

3913 

3961 

4013 

4073 
4133 
4193 
4253 
4300 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1199 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein , 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

H.t V.I «, UU L» UU II. Ph. «» P« «*' II. «» «u «* «" 

.1 5 

»« tvs Val Leu Leu Ala Gly Ala 
ser lie Leu Pro Arg Met Pro Asp Arg Lys val Leu u 30 

•■- I 20 z r 

,i r s.r Gin *, s« v,I U «, »t W«T ««J «* 

Xi. pH S.r V.i Hi. Hi| Gin .Pr. Pro Xi. GIu.*,. v.! Pro GI» 
50 55 

«, ly . cy. Gly Gl« ii; «, Giu Gin Tyr Gly xi. gi„ Ar, v.i gi» 

„« p to hi. tjt a.p «. ii. >» »» »»: ~ ^ wu 

Pro «n II. Thr ~ «V gj *> S * E SS "* 

100 109 

W W U . «. «. «u «« U. =1« Ph. II. «P S « Ii. 

115 120 

s.r II. m »P »> -T «u X.n xr, «. = ** «o »p «, 

130 -. 13D / ■ 

G1- Thr uu Pro Pro Gly «, Thr Ly. «. ||o II. XI. Gly «1 a. 

145 150 

W Pro Gly S.r s.r s« V.I XI. Ii. Gin V.1 Gih X-n U« U. «» 

165 

rw. Tie Ala Tvr Ser Ala Thr Ser lie Asp Leu • 

Leu Phe Asp Il« Pro Gin He Ala Tyr ser 19(J 
180 w 

s« »P «. Thr «u xyr ,y. Tyr »» «~ S "* ^ . 

195 - m ' 

, Ma Met Leu Asp He Val Lys Arg Tyr Asn Trp : 

Thr Leu Gin Ala Arg Ala Met «»u 2 20 

210 2 

Thr xyr v.1 s« XI. v.1 Hi. Thr GIu Gly XJ» Tyr Gly Glu s« Gly 
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Me* Asp Ala Phe Lys Glu Leu Ala Ala Gin Glu Gly Leu Cys lie Ala 



245 



His ser Asp Lys lie Tyr Ser Asn Ala Gly Glu Lys Ser Phe Asp Arg 

260 ; 265 «?' u 

Leu Leu Arg Lys Leu Arg Glu Arg Leu Pro Lys. Ala Arg Val Val Val 

275 ] 280 2* 5 

cys Phe cys Glu Gly Met Thr Val Arg Gly Leu Leu Ser Ala Met Arg 

290 295 

Arg Leu Gly Val Val Gly Glu Phe ser Leu lie Gly Ser Asp Gly Trp 
305 310 

Ala Asp Arg Asp Glu Val lie Glu Gly Tyr Glu Val Glu Ala Asn Gly 

325 330 
Gly ii. Thr lie Lys Leu. Gin Ser Pro Glu Val Arg Ser Phe Asp Asp 



340 3 «5 
Tyr Phe Leu Lys Leu Arg Leu Asp Thr Asn Thr Arg Asn Pro Trp Phe 

Glu- Phe Trp Gin His Arg Phe Gin cys Arg Leu Pro Gly His Leu 



Pro 



Leu 
385 



370 



375 



380 



Glu Asn Pro Asn Phe Lys Lys Val cys Thr Gly Asn Glu Ser Leu 



390 



Glu Glu Asn Wr val Gin Asp Ser Lys Met Gly Phe Val lie Asn Ala 

405 410 

He Tyr Ala Met Ala His Gly Leu Glh Asn Met His His Ala Leu Cys 
420 425 



Pro Gly His Val Gly Leu Cys Asp Ala Met Lys Pro lie Asp Gly Arg 

435 ■ 440 
Lys Leu Leu Asp Phe Leu lie Lys Ser Ser Phe Val Gly Val Ser Gly 



450 



Glu 



Glu Val Trp Phe Asp Glu Lys Gly Asp Ala Pro Gly Arg Tyr Asp 



470 



465 

lie Met Asn lUu Gin Tyr Thr Glu Ala Asn Arg Tyr Asp Tyr Val His 

485 490 

val Gly Thr Trp His Glu Gly Val Leu Asn lie Asp Asp Tyr Lys lie 

500 • - 505 . • . 

sin Met Asn fys S.r Gly Mei £1 Ar, Ser V.1 cys |.r clu Pro cy. 

Uu Ly. «ly Un U Ly. V.1 11. Arg Ly. Oly v.l S.r cy. cy 

' 530 r 535 
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Trp-Il. =y Th r ai. cy. n» «• *» «* ^ ^ sl " "» JS' 

545 550 

Thr cy. Arg Al. Cy. A.P uu Gly Trp Trp Pro A.„ Al. Glu uu Thr 

565 



Gly cy. Glu Pro II. Pro v.1 Ara Tyr Uu Glu Trp S.r Asp II. Glu 



580 

S.r lie U. U. n. Ala Ph. |.r C uu Gly II. UU val Thr I» 

595 600 
Ph. V.1 Thr Uu ". Ph. Val Uu>r «, Asp Thr Pro val val Ly. 
610 615 

Glu Leu cys Tyr lie lie Leu Ala Gly lie Phe Leu 

625 6 30 



Ser Ser Ser Arg Giu Leu cys iy* — — 640 
0 VTyr V.1 cy. Pro Ph. Thr Uu II. Ala Ly, Pro Thr Thr Thr S.r ■ 

cy. Tyr Uu Gin Ar, Uu u. v.! « «« s « "* »£ ^ 

660 665 
Ser Ala Leu Val Thr Lys Thr Asn Arg lie Ala Arg lie Leu Ala Gly 
675 680 

s« Lys W Ly. II. cy. Thr Ar, Lys Pro- Ar, Ph. M.t S~ Ala; Trp 

690 695 

*U cm val II. II. Al. s.r U. 1m. m «| ™ «- >- ^ 

705 710 

«X v.1 Thr 1M II. n. «* «» Pro Pro H.t Pro II. «u s.r Tyr 

Pro S.r II. Ly. Glu val Tyr Uu II; Cy. Un Thr S« A.n Uu «, - 

v.1 val Ala pro v.1 Gly Tyr Ajn oly Uu uu U. M£.s.r C Thr 

755 ■£• 760 

Tyr Tyr Al. Ph. Ly. Thr Ar, Asn v.1 Pro Al. Ajn Ph. »n Glu Ala 

770 775 
Lys Tyr lie Ala Phe Thr Met Tyr Thr Thr ***** ** ™ *~ ^ 
785 790 

Ph. V.1 Pro U. Tyr Ph. Gly s« Asn Tyr Ly. II. U. Thr Thr cy. 

805 °* 
Ph. Al. V.1 S« UU «r val Thr v.1 Al. uu Gly Cy. Hot Ph. Thr 
820 823 

. „, Tva pr Glu Arg Asn Val Arg Ser 

Pro Lys Met Tyr He lie He Ala Lys Pr gxu 9<5 
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Ala Phe Thr Thr Ser Asp Val Val Arg Met His Val Gly Asp Gly Lys 
850 855 860 

Leu Pro Cys Arg Ser Asn Thr Phe Leu Asri lie. Phe Arg Arg Lys Lys 

Pro Gly Ala Gly Asn Ala Asn Ser Ash Gly Lys Ser Val Ser Trp Ser 

885 89° 893 

Glu pro Gly Gly Arg Gin Ala Pro Lys Gly Gin His Val Trp Gin Arg 
900 905 

Leu Ser Val His Val Lys Thr Asn Glu Thr Ala Cys Asn Gin Thr Ala 
: . 9J5 920 925 

Val lie Lys Pro Leu Thr Lys Ser Tyr Gin Gly Ser Gly Lys Ser Leu 

930 935 940 

Thr Phe ser Asp Ala Ser Thr Lys Thr Leu Tyr Asn Val Glu Glu Glu 
945 950 955 

Asp Asn Thr Pro Ser Ala His Phe Ser Pro Pro Ser ser Pro Ser Met 

965 970 

Val Val His Arg Arg Gly Pro Pro val Ala Thr Thr Pro Pro Leu Pro 
980 985 990 

Pro His Leu Thr Ala Glu Glu Thr Pro Leu Phe Leu Ala Asp Ser Val 
995 1000 1005 

lie Pro Lys Gly Leu Pro Pro Pro Leu Pro Gin Gin Gin Pro Gin Gin 
1010 1015 1020 

Pro Pro Pro Gin Gin Pro Pro Gin Gin Pro Lys Ser Leu Met Asp Gin 
1025 1030 1035 104U 

Leu Gin Gly Val Val Thr Asn Phe Gly Ser Gly lie- Pro Asp Phe His 

1045 1050 . iu " 

Ala Val Leu Ala Gly Pro Gly Thr Pro Gly Asn Ser Leu Arg Ser Leu 
10 SO 1065 AO/u 

tyr Pro Pro Pro Pro Pro Pro Gin His Leu Gin Met Leu Pro Leu His 

1075 I 080 1085 

Leu Ser Thr Phe Gin Glu Glu Ser lie Ser Pro Pro Gly Glu Asp lie 

1090 logs iioo 

Asp Asp Asp Ser Glu Arg Phe Lys Leu Leu Gin Glu Phe Val Tyr Gl^ 

1105 1110 Ilia 

Arg Glu Gly Asn Thr Glu Glu Asp Glu W «lu Glu Glu Glu Asp Leu 

1125 1130 

Pr Thr Ala Ser Lys Leu Thr Pro Glu Asp Ser Pro Ala Leu Thr Pr 

1140 1145 AJ " 
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Pro s«: «= Ph. «, a,p s.r v.X Al. s.r 0» "» S " 

1155 1160 

s. r Pro val s« «u s.r cy. Thr Pro Pro «n «, Tyr 

1170 1175 
Aia ser Val lie Leu Arg Asp Tyr Lys Gin Server Ser Thr Leu 
'1185 • 1190 

(2) INFORMATION FOR SEQ ID MO: 3: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 
■(C) STRANDEDRESS: single 
(D) TOPOLOGY: linear 

'ii) MOLECULE TYPE: cDNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZC775 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: , 
GCTAGCATAA CCCCTTGGGG CCTCTAAACG GGTCT 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

i (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZC776 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 4: 
CTCAAGACCC GTTTAGAGGC CCCAAGGGGT TATGCTAGCT GCA 
(2) INFORMATION FOR SEQ ID NO: 5: 

(L) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : lin ar 

- SUBSTITUTE SHEET 
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(vii) IMMEDIATE SOURCE: * 
(B) CLONE: ZC777 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
TGAGGGGTTT TTTGCTGAAA GGAGGAACTA TGCGGCCGCA 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZC778 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
AGCTTGCGGC CGCATAGTTC CTCCTTTCAG CAAAAAACCC 
(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: 2C1751 

(Xi) SEQUENCE DESCRIPTION:. SEQ ID NO: 7: 
AATTCTGTGC TCTGTCAAG 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: singl 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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-fvii) IMMEDIATE SOURCE: 
{ ' 7b) CLONE: ZC1752 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GATCCTTGAC AGAGCACAG 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base. pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

fvii) IMMEDIATE SOURCE: 
1 (B) CLONE: ZC2063 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GATCCAAACT AGTAAAAGAG CT 
(2 ) INFORMATION FOR SEQ ID NO: 10: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 
fCi STRANDEDNESS: single 
(D)j TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

fvii) IMMEDIATE SOURCE: 
l - ' Tb)] CLONE: ZC2 064 

J*. 

t ^» . 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CTTTTACTAG TTTG 

(2 ) INFORMATION FOR SEQ ID NO: 11: 

fi) SEQUENCE CHARACTERISTICS: 

(A) i LENGTH: 43 base pairs 

(B) i TYPE: nucleic acid 
(CV STRANDEDNESS: single 
(D)' TOPOLOGY i linear 

(ii). MOLECULE TYPE: cDNA 
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(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZC2938 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11:. 
GACAGAGCAC AGATTCACTA GTGAGCTCTT TTTTTTTTTT TTT 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

s 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZC3015 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

TTCCATGGCA CCGTCAAGGC T 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZC3016 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
AGTGATGGCA TGGACTGTGG T 
(2) INFORMATION FOR SEQ ID NO:14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucl ic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

, (ii) MOLECULE TYPE: cDNA 
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. (vii) IMMEDIATE SOURCE: 
(B) CLONE: ZC3652 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: . 
ACATGCACCA TG.CTCTGTGT 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

* * ""* . ■ 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: 2C3654 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

21 

AGTGATGGCA TGGACTGTGG T 
(2) INFORMATION FOR SEQ ID NO: 16: 

" CL) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5236 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: SN23 

(ix) FEATURE: 

(A) NAME/KEY: CDS . 

(B) LOCATION: 627.. 3344 

• (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: _ 
- TAAGAATTTT ATAAATACTC TGGGAATm A1TGGTGATG CCTTTGTGTC TACAGGGCAC 60 

acgttccaga gagctctggt gtgaagtgat gggggacttg tcgctagaga agcttttcaa 
KCCCITM* CTCTGGGTCC tgctxgagag aggtctgagg ttctcaacat cagagcagag 
errceACCAA scmcAGAA tgctaagccc ccacitcica acacttagtg ctctgatcgg 
TGcctoccAA ccgagaacgg ctgcagtcct ctgacctgag accaatagct gtgtctaccc 
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GGACTCAGCG TCCAGCTCAC CGCCACTAAC GCGCCGCGCA TTGGACACCT GATCCACACA 360 

CCTTCGGGCA CCAGTGAAAA • ACCGCGACTT GATTTTCTGG AAGAACGCCC CCAGGGTGTG 420 

GGAGCGGTCG TGGAGGACCA GCAGGAGGAA GCGGAGGGGA' GAGGGGCAGT AGTGGAGGCA 480 

GAGAAAGCGT TGAACCAGCT GTGTTGGCCG AAGGCACGAA ACGGCAAAAG GCAGCGGTGA 540 

GCATCTGTGT GGTTCCCGCT GGGAACCTGC AGGCAGGACC GGCGTGGGAA CGTGGCTGGC 600 

CCGCGGTGGA CCGCGTCTTC GCCACA ATG GTC CGG CTC CTC TTG ATT TTC TTC 653 
wv.uwuwj.-w Met Leu Leu Leu lie Phe Phe 

.- " 1 5 

CCA ATG ATC TTT TTG GAG ATG TCC ATT TTG CCC AGG ATG CCT GAC AGA 
Prt Met lie Phe Leu Glu Met Ser lie Leu Pro Arg Met Pro Asp Arg 
10 15 20 « 

AAA GTA TTG CTG GCA GGT GCC TCG TCC CAG CGC TCC GTG GCG AGA ATG 
w!2 ™:S AlJ Gly Ala Ser Ser Gin Arg Ser Val Ala Arg Met 

30 ,. 35 40 

sac GGA GAT GTC ATC ATC GGA GCC CTC TTC TCA GTC CAT CAC CAG CCT : 
Sp Tly Sal III lie Gly Ala Leu Phe Ser val His His Gin Pro 

- 45 50 . " ; 

CCA GCC GAG AAG GTA CCC GAA AGG AAG TGT GGG GAG ATC AGG GAA CAQ. 
Pro Ala Glu Lys Val Pro Glu Arg Lys Cys Gly Glu lie Arg Glu Gin 

60 65 70 

TAT GGT ATC CAG AGG GTG GAG GCC ATG TTC CAC ACG TTG GAT AAG ATT 
5? Gly lie Gin Arg Val Glu Ala Met Phe His Thr Leu Asp Lys lie 
75 80 85 



AAC GCG GAC CCG GTG CTC CTG CCC AAC ATC ACT CTG GGC AGT GAG ATC 
Asn Ala Asp Pro Val Leu Leu Pro . Asn ; lie Thr Leu Gly Ser Glu lie 
90 95 . 10° • 3 

CGG GAC TCC TGC TGG CAC TCT TCA GTG GCT CTC GAA CAG AGC ATC GAA 
Sg 2p sS £s ™ His Ser Ser Val . Ala Leu Glu Gin Ser II; Glu 
V »• 110 115 A * u 

TTC ATC AGA GAC TCC CTG ATT TCC ATC CGA GAT GAG AAG GAT GGG CTG 
S£ ill Xrg Asp Ser Leu lie Ser lie Arg Asp Glu Lys Asp Gly Leu 

125 130 



AAC CGA TGC CTG CCT GAT GGC CAG ACC CTG CCC CCT GGC AGG ACT AAG 
cys Leu Pro Asp Gly Gin Thr Leu Pro Pro Gly Arg Thr Lys 

140 145 



AAG CCT ATT GCT GGA GTG ATC GGC CCT GGC TCC AGC TCT GTG GCC ATT 
JJs Pr lie Ala Gly Val II Gly Pro Gly Ser ser Ser val Ala lie 
1 155 160 165 

CAA GTC CAG AAT CTT CTC CAG CTG TTC GAC ATC CCA CAG ATC GCC TAT 
G^vIl SnAsn LeuLeu GlnLeuPhe AspIlePr Gin II Ala Tyr 

170 175 
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TCT-GCC ACA AGC ATA GAC CTG W« Jjj ^ ™ $ 

ser Ala Thr ser lie Asp. Leu Sec Asp Lys Thr urn xy y *. q 

190 193 , 

isi-ts ss as s&g's si' - - S s - 

205 21 " 

* ,m mrr ire TAT GTC TCA GCA GTC CAC ACA GAA 

ATA GTC AAG CGT TAC AAC TGG ACC TAT JTC ^ ^ 6lu 

lie val Lys Arg Tyr Asn Trp Thr Tyr vax ? 230 
220 22D 



GGG AAT TAC GGC GAG AGT GGA ATG GAT GCT TTC 
S? £sn Tyr Gly Glu Ser Gly Met Asp Ala Phe 
235 240 



AAA GAA CTG GCT GCC 
Lys Glii Leu Ala Ala 

245 .... 



«* -*fti> cic s = at= ==a « j» ;2 j-j g ST 

Sin Clu Gly Leu Cys lie Ala His ser asp .... j 2 65 

250' ■■ 255 . . : / .'• - - . ; . 

Gly Glu Lys Ser Phe Asp Arg Leu Leu » 28Q 

285 *" u • , • . 

GGC TTA «T JCC ^ ^ SS 

Gly Leu Leu Ser Ala Met Arg iurg *-. 310 

■ . " . ,: • 300 ■ 305 - 



■ .315 320 

330 -, 1 ' 335 . . 

^ ■ , lim r » r TAG TTC CTG AAG CTG AGG CTG 

- 350 ■ JM ' 

.365. i 

— ere TTG GAA AAC CCC AAC TTT AAG 

TGT CCC CTA CCT GGA CAC .^C JTG JAA AA to „ phe Lys 

Cys Arg Leu Pro Gly His Leu Leu ew* 
380 385 

395 



GAA GGC 
Glu Gly 



TCT CCA 
Ser Pro 
345 

GAC ACC 
Asp Thr 
360 

TTC CAG 
Phe Gin 



AAA GTG 
lys Val 

AGC AAA 
ser Lys 

CTG CAG 



1229 



1277 



1325 



1373 



1421 



1469 



1517 



1565 



1613 



1661 



1709 



1757 



1805 



1853 
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Met- Gly Phe Val lie Asn Ala lie Tyr Ala Met Ala His Gly Leu Gin 

410 i 7 415 420' 425 

AAC ATG CAC CAT GCT CTG TGT CCC GGC CAT GTG GGC CTG TGT GAT GCT 1949 
Asn Met His His Ala Leu Cys Pro Gly His Val Gly Leu Cys Asp Ala 

) 430 435 440 

ATG AAA CCC ATT GAT GGC AGG AAG CTC CTG GAT TTC CTC ATC. AAA TCC 1997 
Met Lys Pro lie Asp Gly Arg Lys Leu Leu Asp Phe Leu lie Lys Ser 
445 450 455 

TCT TTT GTC GGA GTG TCT GGA GAG GAG GTG TGG TTC GAT GAG AAG GGG 2045 

Ser Pne Val Gly Val Ser Gly Glu Glu Val Trp Phe Asp Glu Lys Gly 
460 465 470 

GAT GCT CCC GGA AGG TAT GAC ATT ATG AAT CTG CAG TAC ACA GAA GCT 2093 
Asp Ala Pro Gly Arg Tyr Asp He Met Asn Leu Gin Tyr Thr Glu Ala 
> A ?5 480 485 

AAT CGC TAT GAC TAT GTC CAC GTG GGG ACC.TGG CAT GAA GGA GTG CTG 2141 
Asn Arg Tyr Asp . Tyr Val His Val Gly Thr Trp His Glu Gly Val Leu 
490 495 500 505 

AAT ATT GAT GAT TAC AAA ATC CAG ATG AAC AAA AGC GGA ATG GTA CGJ 2189 
Asn He Asp Asp Tyr Lys lie Gin Met Asn Lys Ser Gly Met Val Arg 

510 515 azu 

TCT GTG TCC AGT GAG CCT TGC TTA AAG GGT CAG ATT AAG GTC ATA CGG 2237 

ser Val cys ser Glu Pro Cys Leu Lys Gly Gin lie Lys Val lie Arg 
525 5 30 55 

AAA GGA GAA GTG AGC TGC TGC TGG ATC TGC ACG GCC TGC AAA GAG AAT 2285 
Lys Giy Glu Val Ser cys Cys Trp lie Cys Thr Ala Cys Lys Glu Asn 

545 550 



540 



GAG TTT GTG CAG GAC GAG TTC ACC TGC AGA GCC TGT GAC CTG GGG TGG 

Glu Phe Val Gin Asp Glu Phe Thr Cys Arg Ala Cys Asp. Leu Gly Trp 
555 7 560 565 .. . 



CTT GAG TGG AGT GAC ATA GAA TCT ATC ATA GCC ATC GCC TTT TCT TGC 
Leu Glu Trp sS Asp lie Glu Ser lie lie Ala lie Ala Phe Ser Cys 

590 595 ouu 

CTG GGC ATC CTC GTG ACG CTG TTT GTC ACC CTC ATC TTC GTT CTG TAC 
2S Gly II Leu Val Thr Leu Phe Val Thr Leu lie Phe Val Leu Tyr 
1 — - 610 615 



605 



CGG GAC ACA CCC GTG GTC AAA TCC TCC AGT AGG GAG CTC TGC TAT ATC 
Thr P*o val val Lys ser Ser S r Arg Glu Leu cys Tyr lie 
620 625 

ATT CTG GCT GGT ATT TTC CTC GGC TAT GTG TGC CCT TTC ACC CTC ATC 

ile 2u Ala Gly lie Ph Leu Gly Tyr Val Cys Pr Phe Thr Leu lie 
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TGG CCC AAC GCA GAG CTC ACA GGC TGT GAG CCC . ATT CCT GTC CGT TAT .2381 
S S Jin Stclu Leu Thr Gly Cys Glu Pro lie Pro val Arg Tyr 
570 575 580 5" 



2429 



2477 



2525 



2573 
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86 

635 



640 «45 



GCC AAA OCX ACT ACC ACA TCC TOO TAC CXC gO «C CTC CTA CTT GGC 

Ala Lys Pro Thr Thr Thr ser cys iyr wu « ' 665 
650 655 

»^ m,^ m* r T rr GCT TTA GTG ACC AAA ACC AAT CGT 

2S S SS g SS SS 2S S ^ ». «. « 

670 675 



s sss's s s is g s g ffi - i 5 JS 

685 690 



g £ ssis sssas ss s pjs s 2" 

700 705 

S t£ SS 2S' £ £ S 5S ?S S 2 S ■ S g SS S 

715 720 

g IS? SSS 5.SS s i?s £ - = = is 



gggggggsssss"*:* 

J .750 755 

g SB'S g-g ggSjgSS gg g S- 

765 770 

sassassjS5?sssss 

780 785 

gssass ss.g's 2 s g a s. sa 

795 ■-£*■ 800 

SS5SSSSSSSS5SSSS 



810 815 



GCC CTG GGG TGC ATG WgggSES S S2 'S S £ 
Ala Leu Gly Cys Met R» Thr Pro Lys Het Tyr ne xj. mq 

830 933 

SS'SSSSS225SSSSSS 

845 850 



860 865 



2621 



2669 



2717 



2765 



2813 



2861 



2909 



2957 



3005 



3053 



3101 



3149. 



3197 



3245 



eiiocTfTHTP cure-T 
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»»/- ITT TTC CGG AGAAAG AAG CCC GGG GCA GGG AAT GCC AAG AAG AGG 

875 880 885 

a a si s.s a a a as 5 a = a si a g... 

890 895 900 

CTT TGAAAACCCC CACACTGCAG TGAATGTTTC TAACGGCAAG TCTGTGTCAT . 

Leu • 

GGTCTGAACC AGGTGGAAGA CAGGCGCCCA AGGGACAGCA CGTGTGGCAG CGCCTCTCTG 
TGCACGTGAA GACCAACGAG ACGGCCTGTA ACCAAACAGC CGTAATCAAA CCCCTCACTA 
AAAGTTACCA AGGCTCTGGC AAGAGCCTGA CCTTTTCAGA TGCCAGCACC AAGACCCTTT 
ACAATGTGGA AGAAGAGGAC AATACCCCTT CTGCTCACTT CAGCCCTCCC AGCAGCCCTT 
CTATGGTGGT. GCACCGACGC GGGCCACCCG TGGCCACCAC ACCACCTCTG CCACCCCATC 
TGACCGCAGA AGAGACCCCC CTGTTCCTGG CTGATTCCGT CATCCCCAAG GGCTTGCCTC 
CTCCTCTCCC GCAGCAGCAG CCACAGCAGC CGCCCCCTCA GCAGCCCCCG CAGCAGCCCA 
AGTCCCTGAT GGACCAGCTG CAAGGCGTAG TCACCAACTT CGGTTCGGGG ATTCCAGATT 
TCCATGCGGT GCTGGCAGGC CCGGGGACAC CAGGAAACAG CCTGCGCTCT CTGTACCCGC 
CCCCGCCTCC GCCGCAACAC CTGCAGATGC TGCCCCTGCA. CCTGAGCACC TTCCAGGAGG 
AGTCCATCTC CCCTCCTGGG GAGGACATCG ATGATGACAG TGAGAGATTC AAGCTCCTGC 
AGGAGTTCGT GTACGAGCGC GAAGGGAACA CCGAAGAAGA TGAATTGGAA GAGGAGGAGG 
ACCTGCCCAC AGCCAGCAAG CTGACCCCTG AGGATTCTCC TGCCCTGACG CCTCCTTCTC 
CTTTCCGAGA TTCCGTGGCC TCTGGCAGCT CAGTGCCCAG TTCCCCCGTA TCTGAGTCGG 
TCCTCTGCAC CCCTCCAAAT GTAACCTACG CCTCTGTCAT TCTGAGGGAC TACAAGCAAA 
GCTCTTCCAC CCTGTAGTGT GTGTGTGTGT GTGGGGGCGG GGGGAGTGCG CATGGAGAAG 
CCAGAGATGC CAAGGAGTGT CAACCCTTCC AGAAATGTGT AGAAAGCAGG GTGAGGGATG 
GGGATGGAGG ACCACGGTCT GCAGGGAAGA AAAAAAAAAA TGCTGCGGCT GCCTTAAAGA 
AGGAGAGGGA CGATGCCAAC TGAACAGTGG TCCTGGCCAG GATTGTGACT CTTGAATTAT 
TCAAAAACCT TCTCTAGAAA GAAAGGGAAT TATGACAAAG CACAATTCCA TATGGTATGT 
AACTTTTATC GAAAAAAATA ATAAAACGTA AAAATAAAAT CAACAAAAAT AATCTCTTCT 
TTTGCTCAAT CGTGCATACA TATATCTGCC CACACTCCCG TGGTAAAACT AGAAGCGAAG 
CAGGCCCTGC GATGGTGCCA ACTGAATCCT AAGTTCATCA TCCTAGTGAG CAGATGGAGA 
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3694 

3754 

3814 
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3934 

3994 

4054 

4114 

4174 

4234 

4294 

4354 

4414 

4474 

4534 
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4774 
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GASGGCAGGA GGCGAGAGGG CAGGAGGCGG GGGTAGGTTC GGACAACAGC TCCCATCTCA 
GACCTTGACT GTGCTGAGTC TTCAGACTCC TGGACTAAGG AAGACCCGGG GACTGACCTT 
ATGAGGGTCC CTTTCCACTG CTGTGATCCA TTGCCAGCCT GTAGTCACCC GGGATAAAGG 
CACAGTAACC TTTTGCATTC CTGTGATTCC CTGTGTTTAA GGAAAAGGAA AGTATGAGCA 
AAGCTATCAC CAAAAAGAGC GCCATTAGAA GTTACGGGGG AGAAAAAAAG AGAAGCAAGA 
TGATATATAA GCACAGGGCC TTGAACAAGG TGAGCGTGCT TCACAGATTC CGTATTAATG 
TACAGATACT TTTGGAGAGG AGAAAGATAA CAAGGAGTGT CAGGCCGTTT GTGAACTCAC 
TTGCACTGTG CCAACCAGGT TCTCCGCTGC CCTTCAGCAA AA 

^2) LNFORHATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 906 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

\(ii) MOLECULE TYPE: protein . 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

«.t v.1 Ar, Leu Leu Leu II. Ph. Ph. Pro m* U« P"» «u ««<= 



5 



Mn. Leu Pro Aro; Het Pro Asp Arg Ly* v.1 L.U L» Al. sly Al. 
S« s.r Gin AT, s.r v.1 Al. AT, H.t Asp Sly Aep v.1 II. H. sly 
Al. Leu Phi s.r v.! His His Glh Pro Pro Al. Glu Lys VI Pro Glu 

50 55 

xrg Lys Cys Gly. Glu lie Arg Glu G ln Tyr Gly He Gin Arg val Glu 
2 Met Phe His Thr Leu Asp Lys lie Asn Ala Asp Pro Val Leu Leu 

Pro Asn lie Thr Leu Gly Ser Glu lie Arg Asp Ser cys Trp His Ser 

100 105 

ser v.1 Al. Leu slu sin ser lie slu Ph. II. Ar, Asp Ser Wu II. 

115 120 

ser II. AT, Asp Slu Ly, Asp sly Leu Asn Ar= =y. Uu Pro Asp sly 

130 135 

sin Thr Leu Pro Pro sly Ar, Thr Lys Lys Pro II. Al. sly val II. 
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Gly-Pro Gly Ser Ser Ser Val Ala lie Gin Val Gin Asn Leu Leu Gin 



165 



175 



Leu Phe Asp lie Pro Gin lie Ala Tyr Ser Ala Thr Ser lie Asp Leu 

180 185 

Ser Asp Lys Thr Leu Tyr Lys Tyr Phe Leu Arg Val Val Pro Ser Asp 



105 



200 



Thr Leu Gin Ala Arg Ala Met Leu Asp lie val Lys Arg Tyr Asn Trp 

215 220 



210 



Thr Tyr Val Ser Ala val His Thr Glu Gly Asn Tyr Gly Glu Ser Gly 

225 230 235 " U 

Met Asp Ala Phe Lys Glu Leu Ala Ala Gin Glu Gly Leu Cys lie Ala 

245 250 2 

His ser Asp Lys lie Tyr Ser Asn Ala Gly Glu Lys Ser Phe Asp Arg 



260 



265 



Leu Leu Arg Lys Leu Arg Glu Arg Leu Pro Lys Ala Arg Val val val 



275 



280 



Cvs Phe cys Glu Gly Met Thr Val Arg Gly Leu Leu Ser Ala Met. Arg 
290 295 300 

Arg Leu Gly Val Val Gly Glu Phe Ser Leu lie Gly Ser Asp Gly Trp 



305 



310 



Ala Asp Arg Asp Glu Val lie Glu Gly Tyr Glu val Glu Ala Asn Gly 

325 330 
Gly lie Thr lie Lys Leu Gin Ser Pro Glu Val Arg Ser Phe Asp Asp. 

Tyr Phe Leu Lys Leu Arg Leu Asp Thr Asn Thr Arg Asn Pro Trp Phe 



355 



360 



Pro 



Glu Phe Trp Gin His Arg Phe Gin Cys Arg Leu Pro Gly His Leu 



Leu 
385 



370 j** 375 

Glu Asn Pro Asn Phe Lys Lys Val Cys Thr Gly Asn Glu Ser Leu 



390 395 
Glu Glu Asn Tyr val Gin Asp Ser Lys Met Gly Phe Val lie Asn Ala 

405 410 

lie Tyr Ala Met Ala His Gly Leu Gin Asn Met His His Ala Leu Cys 

425 



420 



Pro Gly His val Gly Leu Cys Asp Ala Met Lys Pro lie Asp Gly Arg 

435 440 • 

Lys Leu Leu Asp Phe Leu lie Lys Ser Ser Phe Val Gly Val Ser Gly 
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Glu Glu Val Trp 
465 

He Met Asn Leu 



Phe Asp Glu Lys Gly Asp Ala Pro Gly Arg Tyr Asp 



470 



475 



Gln Tyr Thr Glu Ala Asn Arg Tyr Asp Tyr Val His 



485 



490 



Val Gly Thr Trp His 



Glu Gly val Leu Asn He Asp Asp Tyr Lys He 



500 



505 



Gin Met Asn 
515 



Lys Ser Gly Met Val Arg ser Val cys Ser Glu Pro Cys 



520 



Leu Lys 
530 



Gly Gin lie Lys Val lie Arg Lys Gly Glu Val Ser Cys Cys 



535 



Trp He Cys Thr 
545 



Ala Cys Lys Glu Asn Glu Phe Val Gin Asp Glu Phe 



550 



555 



Thr cy. Ar, Al. SB »'* «- Sl " Trp Trp Pro «n Al. olu L.« Thr 



565 



Gly cys Glu Pro 



lie Pro Val Arg Tyr Leu Glu Trp ser Asp lie Glu 



580 



585 



Ser lie lie 
595 



Ala lie Ala Phe Ser Cys Leu Gly He Leu Val Thr Leu 



600 



Phe Val Thr Leu 
610 

Ser Ser Ser Arg 
625 



lie Phe Val Leu Tyr Arg Asp Thr Pro Val Val Lys 



615 



620 



Glu Leu Cys Tyr lie He Leu Ala Gly He Phe Leu 



630 



635 



Gly Tyr Val cys Pro 



Phe Thr Leu He Ala Lys Pro Thr Thr Thr Ser 



645 



650 



T nL aw Leu Leu Val Gly Leu Ser Ser Ala Met Cys Tyr 
cys Tyr Leu Gin Arg Leu beu vax 6?0 

660 6 " 

Thr Asn Arg lie Ala Arg lie Leu Ala Gly 



680 



Ser Ala Leu Val Thr Lys 
675 

ser Lys Lys Lys He Cys Thr Arg Lys 
* 695 



685 



Pro Arg Phe Met Ser Ala Trp 



690 

Ala Gin Val He 
705 



700 



He Ala Ser He Leu He Ser Val Gin Leu Thr Leu 



710 



715" 



val vi Thr H» lie II. «- « «• ■** "" "I W 



725 



Val Tyr Leu lie Cys Asn Thr Ser Asn Leu Gly 



745 



Pro Ser He Lys Glu 
740 

,1 »1 Al. PT V.1 01* Tyr «n oly »u «» n. H.| «r cy. ». 

755 ; 760 
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Tv- Tyr Ala Phe Lys Thr Arg Asn Val Pro Ala Asn Phe Asn Glu Ala 
770 775 780 

Lys Tyr lie Ala Phe Thr Met Tyr Thr Thr Cys He lie Trp Leu Ala 

Phe val Pro lie Tyr Phe Gly Ser Asn Tyr Lys lie lie Thr Thr Cys 

305 810 8 « 

Phe Ala Val Ser Leu Ser Val Thr Val Ala Leu Gly Cys Met Phe Thr. 
820 825 830 . 

Pro Lys Met Tyr He He He Ala Lys Pro Glu Arg Asn Val Arg Ser 
835 840 845 

Ala Phe Thr Thr Ser Asp Val val Arg Met His val Gly Asp Gly Lys 
850 855 860 

£ku Pro cys Arg Ser Asn Thr Phe Leu Asn He Phe Arg Arg Lys Lys 
865 870 8 ?5 880 

Pro Gly Ala Gly Asn Ala Lys Lys Arg Gin Pro Glu Phe Ser Pro Ser 

885 890 

Ser Gin Cys Pro Ser Ala His Ala Gin Leu 
900 905 

(2) INFORMATION FOR SEQ ID NO:18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4095 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: cDNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: SN30 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 463.. 3198 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 18: 
CCCGGGCTCC CGGCAGTGCG AGCAGCTAAG GGCTGGCCGC CGCCTCCCTG AGCTCCCCCG 
GAGCAGCCGA CCCCTGGTCG CGGCGTTCAC CTCGCCGATG CGCGGTTGGT AGGAGTGACC 
GGAGCCATTC TCTCCTCGTT GATAAGATTC CCTACCAGGA TAGGAGCCTA TCTCCCTTTC 
ACAGCAGGAC ACAGAAATCT GGCCTTCAGT ACTTTGGGAA AAGGATCTGA GACCTCCTGG 
AGCTCTGACC ACTGGCTGTC ATCTGTGGCT CTGGCCTGTG TGGGCCACTG AGCTCTACTC 
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uraui G0AATC ° 0CC MCCCGTTGS PCIGCTGaT 

TACTGAACCT «« CACGTGCCCA GATCATGGGA CCCAGGGCCT GCTAGGGCTA 
GGAGCGGGGC CCAGTATTCA TGGGTCTCTA GGCCTTTCCG AA ATG TCC GGG AAG 

l 

.S S SS S 3' S - S = s SI s 

CAC CCC CAC ATG AAC TCT ATC £A ATT GAC GGG GAC ATC ACA CTG «£ 
Sis Tro His Met Asn Ser lie Arg lie Asp oiy « p gQ 
* 40 43 

GGC CTG TTT CCC GTC CAC GGC CCT GGC TCT GAG GGT AAG « « GGG 

Gly Leu Phe Pro Val His Gly Arg e-ty a . 65 

GAG CTG l AAG GAG AAA GGC ATC CAC CGC CTG GA* GCC ATG .CTG TTT 
Glu Leu Lys Lys Glu Lys Gly He His Arg i* ^ 
70 75 

r\r pta CTG CCC AAC ATC ACG 
GCC CTG GAC CGC ATC AAC AAT GAC CCG GAC CTA CTG ^ 
Ala Leu Asp Arg He Asn Asn Asp pro asp gg 100 

M S is'S SS SS «. 35 S5 S SS SS SS 

105 

a s ss % s a si. sss ss as » S ss sr 

120 125 
GAG GTC CGC TOaGGO AGG CGG GGC CCG CCC ATC ATC ACC AAG CCC GAA 
Glu val Arg Cys Gly Arg Arg Giy 145 
135 140 

SSfSfSSSSfSSSSSSSSSSS! 

150 15 

SS aS SS S SS % X SS S S | - S SS SS SJ 

165 170 

SS S 2 SS § ss ss is SS S Sll ss SS SS S SS 

^ tca GAC ACA TAG CAG GCC CAG GCC ATG GTG GAT ATT 
CGG GTG GTG CCC TCA GAC ACA iaw w» 



360 
420 
474 

522 

570 

618 
666 
714 
762 
810 
858 
906 
954 
1002 
1050 
1098 
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Arj val val Pro Ser Asp Thr Tyr Gin Ala Gin Ala Met Val Asp lie 



200 



GTC CGA GCC CTC AAG TGG AAC TAT GTG TCC ACA CTG GCC TCA GAG GGC 
S5 £g* S SS l£ irp Asn Tyr Val Ser Thr Leu Ala Ser Glu Gly 
215 220 

ACC TAC «GT GAG AGT GGT GTG GAG GCC TTT ATC CAG AAG TCC CGA GAG 
ser Tyr G& SS Ser Gly Val Glu Ala Phe lie Gin Lys Ser Arg Glu 

230 235 240 

«r& «tt gtg TGC ATT GCC CAG TCG GTG AAG ATT CCA CGG GAA CCC 

"iSS W 2J £? S aS Gin ser val Lys lie Pro Arg Glu Pro 

245 250 255 

AAG ACG GGG GAG TTC GAC AAG ATC ATC AAA CGC CTA CTG GAA ACA TCC 
Thr Gly Glu Phe Asp Lys lie He Lys Arg Leu Leu Glu Thr Ser 

1 • . 265 270 

*v * 

AAT GCC AGG GGT ATC ATC ATC TTT GCC AAC GAG GAT GAC ATC AGG AGG 
ita Sa Arg £j lie lie lie Phe Ala Asn Glu Asp Asp lie Arg Arg 
280 285 

r-TG TTG GAG GCA GCT CGC AGG GCC AAC CAG ACC GGC CAC TTC TTT TGG 
SE--S aS S Ari Arg Ala Asn Gin Thr Gly His Phe Phe Trp 
295 300 ' 

iTr GGT TCT GAT AGC TGG GGC TCC AAG AGT GCC CCT GTG CTG CGC CTT 
Met % sS 2*< 8S' Tr? Gly Ser Lys Ser Ala Pro Val Leu Arg Leu 
310 315 .320 

GAG GAG GTG GCC GAG GGC GCA GTC ACC ATT CTC CCC AAG AGG ATG TCT 
■■SS Glu Val Ala Glu Gly Ala Val Thr lie Leu Pro Lys Arg Met Ser 

325 330 333 

GTT CGA GGG TTC GAC CGA TAC TTC TCC AGC CGC ACG CTG GAC AAC AAC 
SS !£g* 3? Phe Asp Arg Tyr Phe Ser Ser Arg Thr Leu Asp Asn Asn 

345 350 J " 

AGG CGC AAC ATC TGG TTT GCC GAG TTC TGG GAG GAC AAC TTC CAT TGC 
i£ Arg Si Trp Sie Ala Glu Phe. Trp Glu Asp Asn Phe His Cys 
360 365 • j/w 

AAG TTG AGC CGC CAC GCG CTC AAG AAG GGA AGC CAC ATC AAG AAG TGC 
ijs leu Ser Irg His Ala Leu Lys Lys Gly Ser His lie Lys Lys cys- 

1 375 380 385 

S i£ 25 3K SS £ 5 35 2S S - - - - « 

.390 395 * uu 

$ a a a a sjs s a T £ a s a a a a 

sss a'a a - a a a a a a aa-sr 
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94 

435 



rAr rma cTT AAG TAG ATC AGG AAC 

CGC ATG GAC CCC GTG GAT GGC ACC CAG CTG CTT A^ ^ ^ ^ ^ 

Arg Met Asp Pro Val Asp Gly Tnr jxn hbu j 450 

' . „„„ rr ~ GTA xCC TTC AAT GAG AAC 

ijs 's s sg s-s as s ?s ^ - - - - 

455 460 

» 5 a » s as? s I - = - - 

470 475 
CCC TOG GCC GAG TAC AAG GTC ATC GGC TCG TGG ACA GAC CAC CTG CAC 
Gly Ser Ala Glu.Tyr Lys Val He Gly ser J|P * 500 

485 490 

crc aoa at* «ac «t x« ™ gf.g^SgS!2!-SS 

Leu Arg IU Glu Arg Met Gin Trp Pro Gly ser - y gi5 

505 51 ? 

520 525 



535 540 

SS.S SI'S & 5. 5 5 51» S i .« - « = 



550 555 



CGG CCC ACA C« »C C«=C A« ACC T=C CJO CCC ATC CCC ATC «C A* 
Arg Pro Thr Glu Asn Arg Thr Ser Cys Gin *ro 58 o 



565 570 



TTG GAG TGG GAC TCG CCG TGG GCC GTG CTG gC CTC TTC CTG GCC GTG 
Leu Glu Trp Asp ser Pro Trp Ala val Leu pro 595 

J*. 585 

sss is ». = s § s »s s g as 
- s s sss s ssrs s. s' a s=« & 



615 620 



2 2 SS S e ' S 3 ^ .SS'SS'S S? = S X. 

630 635 
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CTO GGC ATG AGC ATC AGC TAC GCG GCC CTG CTG ACC AAG ACC AAC CGC 
5u Sy Met Ser Ileiser Tyr Ala Ala Leu Leu Thr Lys Thr Asn Arg 

665 ' 670 675 

ATT TAC CGC" ATC TTT GAG CAG GGC AAA CGG TCG GTC AGT GCC CCG CGT 
5I.t}5 Sg lie pS GlS Gin Gly Lys Arg Ser Val Ser Ala Pro Arg 
680 685 

TTC ATC AGC CCG GCC TCG CAG CTG GCC ATC ACC TTC ATC CTC ATC TCC 
™ ill se? Pro III Ser Gin Leu Ala lie Thr Phe lie Leu lie Ser 
695! 700 705 

CTG CAG CTG ; CTC GGC ATC TGC GTG TGG TTC GTG GTG GAC CCC TCC CAC 
Leu Gin Leu Leu Gly lie Cys Val Trp Phe Val Val Asp Pro Ser His 
710 715 720 

TCG GTG GTG GAC TTC CAG GAC CAA CGG ACA CTT GAC CCC CGC TTT GCC 
ler 'al Val Asp Phe Gin Asp Gin Arg Thr Leu Asp Pro Arg Phe Ala 

725 730 735 740 

AGG GGC GTG CTC AAG TGC . GAC ATC TCG GAC CTG TCC CTC ATC TGC CTG 
j£g G?y Val Leu Lys cys Asp lie Ser Asp Leu Ser. Leu lie Cys Leu 

745 750 



GTG TAC GCC ATC 
Val Tyr Ala He 
770 

AAG CCC ATC GGC 
Lys Pro He Gly 
785 

TTC ATC CCC ATC 
Phe lie Pro He 



CTG GGC TAC AGC ATG CTG CTG ATG GTC ACG TGT ACT 
Leu Gly Tyr Ser Met Leu Leu Met Val Thr Cys Thr 
760 765 

AAG ACC CGAi GGC GTG CCC GAG ACC TTC AAC GAG GCC 
Lys Thr Arg! Gly Val Pro Glu Thr Phe Asn Glu Ala 
7751 780 

TTC ACC ATG \ TAC ACC ACC TGC ATT GTC TGG CTG GCC 
Phe Thr Met Tyr Thr Thr Cys lie Val Trp Leu. Ala 
790 795 800 

TTT TTT GGC 1 ACC TCA CAG TCA GCC GAC AAG CTG TAC ATC CAG ACA ACC 
pS S£ sS Gin ser Ala Asp Lys Leu Tyr lie Gin Thr Thr 

805 .: 810 815 820 

ACA CTG ACG GTC- TCC GTG AGT CTG AGC GCT TCA GTG TCC CTG GGG ATG 
Su Thr val ier Val Ser Leu Ser Ala Ser Val Ser Leu Gly Met 
! 825 830 

CTC TAC ATG CCC AAA GTC TAC ATC ATC CTC TTC CAC CCG GAG CAG AAC 
2u Tyr Met Pro Lys Val Tyr lie lie Leu Phe His Pro Glu Gin Asn 
840 845 

ore CCC AAG CGC AAG CGC AGT CTC AAA GCC GTG GTC ACC GCC GCC ACC 
?£ PrS JJs Sg £s Xrg Ser Leu Lys Ala Val Val |hr Ala Ala Thr 

855 ? 860 1,03 

AAG TTC ACA CAG AAG GGC AAC TTC AGG CCC AAT GGG GAA 
lJs Phe Thr Gin Lys Gly Asn Phe Arg Pro Asn Gly Glu 
875 880 



ATG TCC AAC 
Met S r Asn 
870 



2490 



2538 



2586 



2634 



2682 



2730 



2778 



2826 



2874 



2922 



2970 



3018 



3066 



3114 



GCC AAA TCA GAG CTG TGT GAG AAC CTG GAG ACC CCA GCG CTG GCT ACC 
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Al^LysSer Glu Leu Cys Glu Asn Leu Glu Thr Pro Ala Leu Ala Thr 
885 890 

AAA CAG ACC TAC GTC ACC TAC ACC AAC CAT GCC ATC TAGCCGGGCC 
Lys Gin Thr Tyr Val Thr Tyr Thr Asn His Ala lie 

905 . 910 

GCGGAGCCAA'GCAGGCTAAG GAGCCACAAC CTCTGAGGAT GGCACATTGG GCCAGGGCCG 
TTCCCGAGGG CCCTGCCGAT GTCTGCCCGC CTCCCGGGCA TCCACGAATG TGGCTTGGTG 
CTGAGGACAG TAGAGACCCC GGCCATCACT GCTGGGCAAG CCGTGGTGGG CAACCAGAGG 
AGGCCGAGTG GCTGGGGCAG TTCCAGGTTA TGCCACACAC AGGTCTTCCT TCTGGACCAC 
TGTTGGCCCA GCCCCAAAGC ACAGGGGCTC GGTCTCCAGA GCCCAGCCCT GGCTTCCTCT 
CpXTCCTCCT GCCTCCGTCT GTCCTGTGGG TGACCCCGGT TGGTCCCTGC CCCGTCTTTA 
CGTTTCTCTT CCGTCTTTGC TCTGCATGTG TTGTCTGTTT GGGCCCTCTG CTTCCATATT 
TTTCCATTCT GCTCCTGGCC TTCCCCTGCC ATCTGCCCTG CCCCCTGCCC CTCCTCCCTG 
AGCTGCCCCA TCCCCGCCAT CATTTTCTCT TCTGTTCCCC CTCGATCTCA TTTCCTACCA 
GCCTTCCCCC TACTTGGCTT CATCCACCAA CTCTTTCACC ACGTTGCAAA AGAGAAAAAA 
AAAGGGGGGG GGGAATCACC CCCTACAAAA AAGCCCAAAC AAAAACTAAT CTTGAGTGTG 
TTTCGAAGTG CTGCGTCCTC CTGGTGGCCT GTGTGTCCCT GTGCCTGCAG CCTGTCTGCC 
CGCCCTACCC GTCTGCCGTG TGTCCTGCCC CCCCCGCCTG CCCGCCTTGC CCTTCCTGCT 
AACGACACGG AGTTCAGTGC CTGGGTGTTT GGTGATGGTC TCTGATGTGT AGCATGTCTG 
TTTTTATACC GAGAACATTT CTAATAAAGA TAAACACATG GTTTTGC 

(2) INFORMATION FOR SEQ ID NO: 19: ' 

fi) SEQUENCE CHARACTERISTICS: • 

(A) LENGTH: 912 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:19: 

Met ser Gly Lys_Gly Gly Trp Ala ^ ^^^^..^ fU ^ - 



3208 

3268 

3328 

3388 

3448 

3508 

3568 

3628 

3688 

3748 

3808 

3868 

3928 

3988 

4048 

4095 
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Ile_Thr Leu Gly Gly Leu Phe Pro val His Gly Arg Gly Ser Glu Gly 

60 



50 



55 



Lys Ala cys Gly Glu Leu Lys Lys Glu Lys Gly lie His Arg Leu Glu 

65 70 



Ala Met 



Pro Asn 



Leu Phe Ala Leu Asp Arg lie Asn Asn Asp Pro Asp Leu Leu 



85 



90 



lie Thr Leu Gly Ala Arg He Leu Asp Thr Cys Ser Arg Asp 



100 



105 



110 



Thr His Ala Leu Glu Gin Ser Leu Thr Phe Val Arg Ala Leu lie Glu 
115 120 125 : 

Lys Asp Gly Thr Glu val Arg Cys Gly Arg Arg Gly Pro Pro lie He 



130 



135 



Thr Lys ProGlu Arg val Val Gly Val lie Gly Ala Ser Gly Ser Ser 
145 150 

lie Met val Ala Asn lie Leu Arg Leu Phe Lys lie Pro Gin 



Val Ser 



.165 . 170 

lie ser Tyr Ala ser Thr Ala Pro Asp Leu Ser Asp Asn Ser Arg. Tyr 



180 185 
Asp Phe Phe ser Arg Val Val Pro Ser Asp Thr Tyr Gin Ala Gin Ala 



195 



200 



Met Val Asp lie Val Arg Ala Leu Lys Trp Asn Tyr Val Ser Thr Leu 

210 215 

Ala Ser Glu Gly Ser Tyr Gly Glu Ser Gly val Glu Ala Phe lie Gin 

225 230 233 



Lys Ser 



Arg Glu Asn Gly Gly Val Cys lie Ala Gin Ser Val Lys lie 



245 250 
Pro Arg Glu Pro Lys Thr Gly Glu Phe Asp Lys lie lie Lys Arg Leu 



Leu 



260 265 
Glu Thr Ser Asn Ala Arg Gly lie lie lie Phe Ala Asn Glu Asp 



275 



280 



Asp lie Arg Arg Val Leu Glu Ala Ala Arg Arg Ala Asn Gin Thr Gly 
290 295 

His Phe Phe Trp Met Gly Ser Asp Ser Trp Gly Ser Lys Ser Ala Pro 

305 310 313 

val Leu Arg Leu Glu Glu Val Ala Glu Gly Ala Val Thr lie Leu Pro 

3 2 5 

Lys Arg Met S r Val Arg Gly Phe Asp Arg Tyr Phe Ser S^r Arg Thr 



340 



.Ma*TlTUTE SHEET 



WO 92/10583 



PCT/US91/09422 



Leu Asp Asn Asn Arg Arg Asn lie Trp Phe Ala Glu Phe Trp Glu Asp 



355 



360 



Asn 



Phe His cys Lys Leu Ser Arg His Ala Leu Lys Lys Gly Ser His 



370 375 
lie Lys Lys Cys Thr Asn Arg Glu Arg He Gly Gin Asp Ser Ala Tyr 



385 - 390 

Glu Gin Glu Gly Lys Val Gin Phe Val lie Asp Ala Val Tyr Ala Met 

405 410 

Gly His Ala Leu His Ala Met His Arg Asp Leu Cys Pro Gly 
420 425 

Gly Leu Cys Pro Arg Met Asp Pro Val Asp Gly Thr Jin Leu 



435 440 
Tyr lie Arg Asn Val Asn Phe Ser Gly lie Ala Gly Asn Pro 



415 

Arg Val 
Leu Lys 
Val Thr 



450 



455 



Phe Asn Glu Asn Gly Asp Ala Pro Gly Arg Tyr Asp lie Tyr Gin Tyr 



465 



470 



480 



Gin Leu Arg Asn Gly Ser Ala Glu Tyr Lys Val lie Gly Ser Trp Thr 

485 490 

Asp His Leu His Leu Arg lie Glu Arg Met Gin Trp . Pro Gly Ser Gly 

500 505 " " s &1 ° 

Gin Gin Leu Pro Arg Ser lie Cys Ser Leu Pro Cys Gin Pro Gly Glu 

515 520 525 

Arg Lys Lys Thr Val Lys Gly Met Ala cys Cys Trp His Cys Glu Pro 

530 535 

cys Thr Gly Tyr Gin Tyr Glh val Asp Arg Tyr Thr Cys Lys Thr cys 

545 550 553 

Tyr Asp Met Arg Pro thr Glu Asn Arg Thr Ser Cys Gin Pro lie 
He val Lys Leu Glu Trp Asp Ser Pro Trp Ala Val Leu Pro Leu 

580 585 
Ala Val Val Gly lie Ala Ala Thr Leu Phe Val Val val Thr 



Pro 
Pro 
Phe Leu 



595 



600 



Phe 



Val Arg Tyr Asn Asp Thr Pro lie Val Lys Ala Ser Gly Arg Glu 



610 



615 



val Leu Leu Ala Gly lie Phe Leu Cys Tyr Ala Thr Thr 



Leu S r Tyr — - fi35 

625 630 635 



Phe Leu 



Met lie Ala Glu Pro Asp Leu Gly Thr Cys Ser Leu Arg Arg 



645 



650 
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13* Phe Leu Gly Leu Gly Met Ser lie Ser Tyr Ala Ala Leu Leu Thr 

660 665 
Lys Thr Asn Arg He Tyr Arg lie Phe Glu Gin Gly Lys Arg ser val 

675 680 
Ala Pro Arg Phe lie Ser Pro Ala Ser Gin Leu Ala lie Thr Phe 



Ser 

690 



695 ™° 



lie Leu lie Ser Leu Gin Leu Leu Gly lie Cys Val Trp Phe Val Val 
705 710 /Aa 

Asp Pro Ser His Ser Val val Asp Phe Gin Asp Gin Arg Thr Leu Asp 

725 730 

Pro Arg Phe Ala Arg Gly Val Leu Lys Cys Asp lie Ser Asp Leu Ser 
740 745 



Leu lie cys Leu Leu Gly Tyr Ser Met Leu Leu Met val Thr Cys Thr 

755 760 ' 03 

val Tyr Ala lie Lys Thr Arg Gly val Pro Glu Thr Phe Asn Glu Ala 



Lys Pro lie Gly Phe Thr Met Tyr Thr Thr Cys He Val Trp Leu Ala 

785 790 

Phe lie Pro lie Phe Phe Gly Thr Ser Gin Ser Ala Asp Lys Leu Tyr 

805 810 

lie Gin Thr Thr Thr Leu Thr Val Ser Val Ser. Leu Ser Ala Ser Val 

820 825 
ser Leu Gly Met Leu Tyr Met Pro Lys Val Tyr lie lie Leu Phe His 
835 840 

Pro Glu Gin Asn Val Pro Lys Arg Lys Arg Ser Leu Lys Ala Val Val 

850 8 55 860 

Thr Ala Ala Thr Met ser Asn Lys Phe Thr Gin Lys Gly Asn Phe-Arg 

865 870 873 



Pro Asn 



Gly Glu Ala Lys Ser Glu Leu Cys Glu Asn Leu Glu Thr Pro 



885 



Ala Leu Ala Thr Lys Gin Thr Tyr Val Thr Tyr Thr Asn His Ala lie 

900 905 



(2) INFORMATION FOR SEQ ID NO: 20: 

(L) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2426 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: SR13 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20:, 
CCCAACATCA OGTTGGGCGC CCGCATTCTG GACACCTGCT CGAGGGACAC CCACGCCCTG 
GAGCAGTCAC TGACCTTTGT GCGGGCGCTC ATCGAGAAGG ACGGCACGGA GGTCCGCTGC 
GGCAGGCGGG GCCCGCCCAT CATCACCAAG CCCGAACGAG TGGTGGGTGT CATTGGAGCT 
TCGGGGAGCT CCGTCTCGAT CATGGTGGCC AACATCCTCC GCCTCTTCAA GATCCCTCAG 
ATCAC CTATG CCTCCACGGC CCCTGACTTG AGTGACAACA GCCGCTATGA CTTCTTCTCC 
CGGGTGGTGC CCTCAGACAC ATACCAGGCC CAGGCCATGG TGGATATTGT CCGAGCCCTC 
AAGTGGAACT ATGTGTCCAC ACTGGCCXCA GAGGGCAGCT ACGGTGAGAG TGGTGTGGAG 
GCCTTTATCC AGAAGTCCCG AGAGAACGGA GGTGTGTGCA TTGCCCAGTC GGTGAAGATT 
CCACGGGAAC CCAAGACGGG GGAGTTCGAC AAGATCATCA AACGCCTACT GGAAACATCC 
AATGCCAGGG GTATCATCAT CITTGCCAAC GAGGATGACA TCAGGAGGGT GTTGGAGGCA 
GCTCGCAGGG CCAACCAGAC CGGCCACTTC TTTTGGATGG GTTCTGATAG CTGGGGCTCC 
AAGAGTGCCC CTGTGCTGCG CCTTGAGGAG GTGGCCGAGG GCGCAGTCAC CATTCTCCCC 
AAGAGGATGT CTGTTCGAGG GTTCGACCGA TACTTCTCCA GCCGCACGCT GGACAACAAC 
AGGCGCAACA TCTGGTTTGC CGAGTTCTGG GAGGACAACT TCCATTGCAA GTTGAGCCGC 
CACGCGCTCA AGAAGGGAAG CCACATCAAG AAGTGCACCA ACCGAGAGCG CATCGGGCAG 
CACTCGGCCT ATGAGCAGGA GGGGAAGGTG CAGTTCGTGA TTGACGCTGT GTACGCCATG 
GGCCACGCGC TGCACGCCAT GCACCGTGAC CTGTGTCCCG GCCGCGTAGG ACtCTGCCCT 
CGCATGGACC CCGTGGATGG CACCCAGCTG CTTAAGTACA TCAGGAACGT CAACTTCTCA 
GGCATTGCGG GGAACCCTGT AACCTTCAAT GAGAACGGAG ACGCACCGGG GCGCTACGAC 
ATCTACCAGT ACCAACTGCG CAATGGCTCG GCCGAGTACA AGGTCATCGG CTCGTGGACA 

gAccacctgc acctcagaat agagcggatg cagtggccag ggagtggcca gcagctgccg 

CGCTCCATCT GCAGTCTGCC CTGCCAGCCC GGGGAGCGAA AGAAGACTGT GAAGGGCATG 
CCITGCTGC. GGCACTGCGA GCCCTGCACC GGGTACCAGT ACCAAGTGGA CCGCTACACC 
TGTAAGACCT GCCCCTACGA CATGCGGCCC ACAGAGAACC GCACGAGCTG CCAGCCCATC 

SUBSTITUTE SHEET 
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CCCATCGTCA AGTTGGAGTG GGACTCGCCG TGGGCCGTGC TGCCCCTCTT CCTGGCCGTG 
GTGGGCATCG CCGCCACGCT GTTCGTGGTG GTCACGTTTG TGCGCTACAA CGATACCCCC 
ATCGTCAAGG CCTCGGGCCG GGAGCTGAGC TACGTGCTGC TGGCGGGCAT CTTTCTGTGC 
TACGCCACTA CCTTCCTCAT GATCGCAGAG CCGGACCTGG GGACCTGTTC GCTCCGCCGC 
ATCTTCCTAG GGCTCGGCAT GAGCATCAGC TACGCGGCCC TGCTGACCAA GACCAACCGC 
ATTTACCGCA TCTTTGAGCA GGGCAAACGG TCGGTCAGTG CCCCGCGTTT CATCAGCCCG 
GCCTCGCAGC TGGCCATCAC CTTCATdCTC ATCTCCCTGC AGCTGCTCGG CATCTGCGTG 
TGGTTCGTGG TGGACCCCTC CCACTCGGTG GTGGACTTCC AGGACCAACG GACACTTGAC 
CCCCGCITTG CCAGGGGCGT GCTCAAGTGC. GACATCTCGG ACCTGTCCCT CATCTGCCTG 
CTGGGCTACA GCATGCTGCT GATGGTCACG TGTACTGTGT ACGCCATCAA GACCCGAGGC 
GTGCCCGAGA CCTTCAACGA GGCCAAGCCC ATCGGCTTCA CCATGTACAC CACCTGCATT 
GTCTGGCTGG CCTTCATCCC CATCTTTTTT GGCACCTCAC AGTCA6CCGA CAAGCTCTAC 
ATCCAGACAA CCACACTGAC GGTCTCCGTG AGTCTGAGCG CTTCAGTGTC . CCTGGGGATG 
CTCTACATGC CCAAAGTCTA CATCATCCTC TTCCATATTT TTCCATTCTG CTCCTGGCCT 
TCCCCTGdCA TCTGCCCTGC CCCCTGCCCC TCCTCCCTGA GCTGCCCCAT CCCCGCCATC 
ATTTTCTCTT CTGTTCCCCC TCGATCTCAT TTCCTACCAG CCTTCCCCCT ACTTGGCTTC 
CTCCACCAAC TCTTTCACCA CGTTGC 
(2) INFORMATION FOR SEQ ID NO: 21: 

- ■ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1? amino acids 

(B) TYPE: amino acid . 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Arg Asp Ser L u lie Ser lie Arg Asp Glu Lys Asp Gly Leu Asn Arg 

1 5 " ' ; --■ " . ' •• . 

• cys - 

(2) INFORMATION FOR SEQ ID NO:22: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

, eiu Ara Leu Pro Lys Ala Arg val 
Asp Arg Leu Leu Arg Lys Leu Arg Glu Arg 15 

1 ■ : ■ ' 5 

(2) INFORMATION FOR SEQ ID NO:23: 

( i) SEQUENCE CHARACTERISTICS : 

(A) LENGTHS 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Glu Glu val Trp Phe Asp Glu Lys Gly Asp Ala Pro Gly Arg Tyr Asp 



(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY:. linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Glu Phe Val Tyr Glu Arg Glu Gly Asn Thr Glu Glu Asp Glu Leu 

l ■ . 5 . ... . 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amin acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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- (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Pro Glu Arg Lys Cys Cys Glu Ilk Arg Glu Gin Tyr Gly lie Gin Arg 

1 • 5 10 15 

Val 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

lie Gly Pro Gly s 
1 5 



He Gly Pro Gly Ser Ser Ser Val Ala He Gin Val Gin Asn Leu Leu 

10 19 



(2) INFORMATION FOR SEQ ID NO: 27: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

He Ala Tyr Sor Ala Thr Ser He Asp Leu Ser Asp Lys Thr Leu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amin acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
. Lys Lys^ro Gly Ala Gly Xsn Ala Lys .Lys Arg Gin Pro Gl» Ph. Ser 

(2) INFORMATION FOR SEQ ID NO: 29s 

(i) SEQUENCE CHARACTERISTICS: 

(A) ; LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(jci) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Pro G1 « Phe ser Pro Ser Ser Gin cys Pro Ser Ala His Ala Gin Leu 



1 5 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 
(Bi TYPE: amino acid . 
(CV STRANDEDNESS: single 
(Dfc TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

A sp Lys tie II. Lys Arg Leu Leu Glu Thr Ser Ash Ala Arg Gly 
1 ' ' 5 . ' 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) > LENGTH: 16 amino acids 

(B) * TYPE: amino acid 

(C) ^ STRANDEDNESS r single 

(D) j TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

i ' ' 

i - ■ 

(Xi) SE0OBICB DBSCRimOHl SB} ID H01311 
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val Asn Phe Ser Gly lie Ala Gly Asn Pro Val Thr Phe Asn Glu Asn 

i 5 10 ; 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 
.•(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Gly Glu Ala Lys Ser Glu Leu Cys Glu Asn Leu Glu Thr Pro Ala Leu 

(2) INFORMATION FOR SEQ ID NO:33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Pro Ala Arg Leu Ala Leu Pro Ala Asn Asp Thr Glu Phe Ser Ala Trp 
1 5 10 

" Val . 
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TS CIAIMED ISt 

1. An isolated mammalian G protein-coupled glutamate 
receptor or a fragment thereof. 

2. The G protein-coupled glutamate receptor of claim 
1, which is substantially pure. 

3. The G protein-coupled glutamate receptor of claim 
1, which is human or rodent. 

4. An antiserum obtained from an animal immunized 
with the G protein-coupled glutamate receptor of claim 1. 

5. A monoclonal antibody which specifically binds to 
the G. protein-coupled glutamate receptor of claim 1. 

6. The G protein-coupled glutamate receptor of claim 
1, which binds glutamate or quisqualate and thereby activates 
phospholipase C or stimulates inositol phospholipid metabolism 
in a vertebrate ceil. 



7. a recombinantly produced polypeptide having the 
activity of a mammalian G protein-coupled glutamate receptor. 

8. The polypeptide of claim 7, which has the 
activity of a human or rodent mammalian G protein-coupled 
glutamate receptor. 

9. An isolated arid purified polynucleotide molecule 
which codes for a mammalian 6 protein-coupled glutamate 
receptor or a fragment thereof. 

10. The polynucleotide of claim 9, which is a 
genomic DNA sequence, a cDNA sequence, or an BNA antisense 
sequence.' 
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11. The polynucleotide of claim 9, which codes for 
human. or rodent . G protein-c upled glutamate receptor. 

12. The polynucleotide of claim 9, which encodes a 
polypeptide displaying mammalian G protein-coupled glutamate 
receptor activity. 

, 13. The polynucleotide of claim 9, which is 
substantially the sequence of Pig. 5, Fig. 7, Fig. 8 or Fig. 9. 

14. A probe which comprises an oligonucleotide 
capable of specifically hybridizing with a gene which encodes a 
mammalian G protein-coupled glutamate receptor or a fragment 
thereof. 

15. The probe of ciaim 14 , which comprises from 
about 40 to about 60 nucleotides in length. 

16. The probe of claim 15, which is labeled to 
provide a detectable signal. 

17. A DNA construct comprising the following 
operably linked elements: 

a transcriptional promoter; 

a DNA sequence encoding a mammalian G protein-coupled 
glutamate receptor or a fragment thereof; and 
a transcriptional terminator. 

18 . The DNA construct of claim 17, wherein the DNA 
sequence encodes a human or rodent G protein-coupled glutamate 
receptor polypeptide. 

19. The DNA construct of claim 17, wherein the DNA 
sequence encoding the mammalian G protein-coupled glutamate 
receptor is substantially the sequence of Fig. 5, Fig. 7, Fig. 
8 or Fig. 9. 
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20. A cultured eukaryotic cell transformed or 
transf cted with a DNA construct which comprises th following 
operably linked elements: 

a transcriptional prompter; 
5 a DNA sequence encoding a mammalian G protein-coupled 

glutamate receptor or a fragment thereof; and 
a transcriptional terminator. 

21. The eukaryotic cell of claim 20, which is a 
10 mammalian cell. 

22. The eukaryotic cell of claim 20, which does not 
express endogenous G protein^coupled glutamate receptors. 

I5 23. The eukaryotic cell line of claim 20, wherein 

the DNA sequence encodes a human or rodent G protein-coupled 
glutamate receptor polypeptide. 

24. The eukaryotic cell line of claim 21, wherein 

20 the G protein-coupled glutamate receptor polypeptide encoded by 
the DNA. sequence is coupled to G protein in a mammalian cell. 

25. The DNA construct of claim 20, wherein the DNA 
sequence encoding, the mammalian G protein-coupled glutamate 

25 receptor is substantially the sequence of Fig. 5, Fig. 7, Fig. 
8 or Fig. 9. 

26. A method for producing a mammalian G protein- 
coupled glutamate receptor, which comprises: 

30 growing eukaryotic cells transformed or transf ected 

with a DNA construct which comprises a DNA sequence coding for 
the expression of the G protein-coupled glutamate receptor, and 
isolating the receptor from the cells. 



35 



27. The method of claim 26, wherein the c lis 
cultured mammalian cells. 
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28. The method of claim 26, wherein the glutamate 
receptor is human or rodent. 

29. The method of claim 26, wherein the glutamate 
receptor is isolated by immunoaffinity purification. 

30. The method of claim 26, wherein the G protein- 
coupled glutamate receptor is not coupled to protein G in the 
eukaryotic cells. 



31. A: method for determining the presence of a 
mammalian G protein-coupled glutamate receptor in a biological 
sample, which comprises incubating the sample with a 
monospecific antibody which specifically binds to the receptor 

15 , under conditions sufficient for immune complex formation and 
determining therefrom the presence of the immune complexes. 

32. The method of claim 31, wherein the monospecific 
antibody is a monoclonal antibody or a purified antiserum. 

20 

33. The method of claim 32, wherein the monospecific 
antibody is labeled. 

.34. A method for identifying a compound which alters 
25 G protein-coupled glutamate receptor mediated-metabolism, which 
comprises ■ incubating the compound with eukaryotic cells which 
express recombinant mammalian G protein-coupled glutamate 
receptor ind determining therefrom the effect of said compound 
on receptor-mediated metabolism in the cells. 



35. The method of claim 34, wherein the compound is 



incubated with the receptor and ligand. 

36. The method of claim 35, wherein the ligand is 

.j 

35 glutamate or quisqualate. 
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37. The method of claim 34, wherein the eiikaryotic 
c 11 expresses a human or rodent 6 protein-coupled glutamate 

receptor. 

38. The method of claim 37, wherein inositol 
phospholipid metabolism in the eukaryotic cell is monitored fc 
alteration by the compound. 
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CCGAGAACGG CTGCAGTCCT CTGACCTGAG ACCAATAGCT GTGTCTACCC GGACTCAGCG 

" 120 
TCCAGCTCAC CGCCACTAAC GCGCCGCGCA TTGGACACCT GATCCACACA CCTTCGGGCA 

180 

CCAGTGAAAA ACCGCGACTT GATTTTCTGG AAGAACGCCC CCAGGGTGTG GGAGCGGTCG 

240 

TGGAGGACCA GCAGGAGGAA GCGGAGGGGA GAGGGGCAGT AGTGGAGGCA GAGAAAGCGT 

300 

TGAACCAGCT GTGTTGGCCG AAGGCACGAA ACGGCAAAAG GCAGCGGTGA GCATCTGTGT 

360 

GGTTCCCGCT GGGAACCTGC AGGCAGGACC GGCGTGGGAA CGTGGCTGGC CGGCGGTGGA 

409 

CCGCGTCJTC GCCACA ATG GTC CGG CTC CTC TTG ATT .TTC TTC CCA ATG 

Met Val Arg Leu Leu Leu lie Phe Phe Pro Met 

.1 5 . 10 

457 ■ 

ATC TTT TTG GAG ATG TCC ATT TTG CCC AGG ATG CCT GAC AGA AAA GTA 
He Phe Leu Glu Met Ser lie Leu Pro Arg Met Pro Asp Arg Lys Val 

15 20 25 

505 

TTG CTG GCA GGT GCC TCG TCC CAG CGC TCC GTG GCG AGA ATG GAC GGA 
Leu Leu Ala Gly Ala Ser Ser Gin Arg Ser Val Ala Arg Met Asp Gly 
30 35 40 

553 

GAT GTC ATC ATC GGA GCC CTC TTC TCA GTC CAT CAC CAG CCT CCA GCC 
Asp Val lie lie Gly Ala Leu Phe Ser Val His His Gl h Pro Pro Ala 
45 50 55 

601 

GAG AAG GTA CCC GAA AGG AAG TGT GGG GAG ATC AGG GAA CAG TAT GGT 
Glu Lys Val Pro Glu Arg Lys Cys Gly Glu He Arg Glu Gl n Tyr Gly 
60 ' . 65 70 75 

; ' '.649: 

ATC CAG AGG GTG GAG GCC ATG TTC CAC ACG TTG GAT AAG ATT AAC GCG 
He Gin Arg Val Glu Ala Met Phe His Thr Leu Asp Lys He Asn Ala 

80 .85 90 

• 697- 
GAC CCG GTG CTC CTG CCC AAC ATC ACT CTG GGC AGT GAG ATC CGG GAC 
Asp Pro Val Leu Leu Pro Asn He Thr Leu Gly Ser Glu He Arg Asp 

95 . .100 105 

745 

TCC TGC TGG CAC TCT TCA GTG GCT CTC GAA CAG AGC ATC GAA TTC ATC 
Ser Cys " Tr'p" His Ser Ser Val Ala Leu Glu Gin Ser lie Glu Phe lie 
3 lib 115 120 

Fia 5A 
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AGA GAC TCC CTG ATT TCC ATC CGA GAT GAG AAG GAT GGG CTG AAC CGA 

Arg Asp Ser Leu He Ser lie Arg Asp Glu Lys Asp Gly Leu Asn Arg 
125 ; 130 135 

841 

TGC CTG CCT GAT GGC CAG ACC CTG CCC CCT GGC AGG ACT AAG AAG CCT 

Cys Leu Pro Asp Gly Gin Thr Leu Pro Pro Gly Arg Thr Lys Lys Pro 

140 145 • 150 155 

889 

ATT GCT GGA GTG ATC GGC CCT GGC TCC AGC TCT GTG GCC ATT CAA GTC 
He Ala Gly Val He Gly Pro Gly Ser Ser Ser Val Ala He Gin Val 

160 165 170 

937 

CAG AAT CTT CTC CAG CTG TTC GAC ATC CCA CAG ATC GCC TAT TCT GCC 
Gin Asn Leu Leu Gin Leu Phe Asp He Pro Gin He Ala Tyr Ser Ala 
175 180 185 

ACA AGC ATA GAC CTG AGT GAC AAA ACT TTG TAC AAA TAC TTC CTG AGG 
Thr Ser He Asp Leu Ser Asp. Lys Thr Leu Tyr Lys Tyr Phe Leu Arg 
190 195 200 

1033 

GTG GTC CCT TCT GAC ACT TTG CAG GCA AGG GCG ATG CTC GAC ATA GTC 
Val Val Pro Ser Asp Thr Leu Gin Ala Arg Ala Met Leu Asp He Val 
205 210 215 

• 1081 
AAG CGT TAC AAC TGG ACC TAT GTC TCA GCA GTC CAC ACA GAA GGG AAT 
Lys Arg Tyr Asn Trp Thr Tyr Val Ser Ala Val. His Thr Glu Gly Asn 
220 225 230 235 

TAC GGC GAG AGT GGA ATG GAT GCT TTG AAA GAA CTG GCT GCC CAG^GAA 
Tyr Gly Glu Ser Gly Met Asp Ala Phe Lys Glu Leu Ala Ala Gin Glu 

240 245 250 

GGC CTC TGC ATC GCA CAC TCG GAC AAA ATC TAC AGC AAT GCT GGC ^AG 
Gly Leu Cys lie Ala His Ser Asp Lys He Tyr Ser Asn Ala Gly Glu 
255 260 ■ 265 

• '• ':' ■ ■ 1225 

AAG AGC TTT GAC CGG CTC CTG CGT AAA CTC CGG GAG CGG CTT CCC AAG 
Lys Ser Phe Asp Arg Leu Leu Arg Lys Leu Arg Glu Arg Leu Pro Lys 
270 275 280 . 

1273 

GCC AGG GTT GTG GTC TGC TTC TGC GAG GGC ATG ACA GTG CGG GGC TTA 
Ala Arg Val Val Val Cys Phe Cys Glu Gly Met Thr Val Arg Gly Leu 
285 290 295 

F/a SB 
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■ ■ ' ■ 1321 

CTG AGT GCC ATG CGC CGC CTG GGC GTC GTG GGC GAG TTC TCA CTC ATT 
Leu Ser Ala Met Arg Arg Leu Gly Val Val Gly Glu Phe Ser Leu lie 
300 305 310 - 315 

■ . '.' " - " ■ - 1369 

GGA AGT GAT GGA TGG GCA GAC AGA GAT GAA GTC ATC GAA GGG TAT GAG 

Gly Ser Asp Gly Trp Ala Asp Arg Asp Glu Val lie Glu Gly Tyr Glu 

320 .325 . 330 

' " ■ r -- ; . ■ - 1417 

GTG GAA GCC AAC GGA GGG ATC ACA ATA AAG CTT CAG TCT CCA GAG GTC 
Val.Glu Ala Asn Gly Gly He Thr He Lys Leu Gin Ser Pro Glu Val 
335 340 345 

. ' : ' V ." . • ■ 1465 

AGG TCA TTT GAT GAC TAC TTC CTG AAG CTG AGG CTG GAC ACC AAC ACA 

Arg Ser Phe Asp Asp. Tyr Phe Leu Lys Leu Arg Leu Asp Thr Asn Thr 

350 . 355 360 

AGG AAT CCT TGG TTC CCT GAG TTC TGG CAA CAT CGC TTC CAG TGT CGC 
Arg Asn Pro Trp Phe Pro Glu Phe Trp Gin His Arg Phe Gin Cys Arg 
. 365. : 370 375 

' '■ • ' ' 1561 

CTA CCT GGA CAC CTC TTG GAA AAC CCC AAC TTT AAG AAA GTG TGC ACA 

Leu Pro Gly His Leu Leu Glu Asn Pro Asn Phe Lys Lys Val Cys Thr 

380 385 390 - ; 395 

■ © • ' 1609 

GGA AAT. GAA AGC TTG GAA GAA AAC TAT GTC CAG GAC AGC AAA ATG GGA 

Gly Asn Glu Ser Leu Glu Glu Asn Tyr Val Gin Asp Ser Lys Met Gly 

• 400 405 410 

: r ■■■ ■.'■*■■■ ■ 1657 

TTT GTC ATC AAT GCC ATC TAT GCC ATG GCA CAT GGG CTG CAG AAC ATG 
Phe Val He Asn Ala He Tyr Ala Met Ala His Gly Leu Gin Asn Met 

415 , 420 425 . 

.„•'■'• \ ■ ' : --' ! '-v ; '' " V 1705 

CAC CAT GCT CTG TGT CCC GGC CAT GTG GGC CTG TGT GAT GCT ATG AAA 

His. His Ala Leu Cys Pro Gly His Val Gly Leu Cys Asp Ala Met Lys 

430 - : -■■ •':/. : 435 ' " ; " 440' ■ 

' ■ '•- : '. ' ; : " : . V. ; • 1753 

CCC ATT GAT GGC AGG AAG CTC CTG GAT TTC CTC ATC AAA TCC TCT TTT 

Pro lie Asp Gly Arg Lys Leu Leu Asp Phe Leu lie. Lys Ser Ser Phe 
,445 450 ■ 455 

• ■ 1801 

GTC GGA GTG TCT GGA GAG GAG GTG TGG TTC GAT GAG AAG GGG GAT GCT 

Val Gly Val Ser Gly Glu Glu Val Trp Phe Asp Glu Lys Gly Asp Ala 

460 465 470 . 475 

v;"-* 1 na 
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10/32 1849 
CCC GGA AGG TAT GAC ATT ATG AAT CTG CAG TAC ACA GAA GGT AAT CGC 
Pro. Gly Arg Tyr Asp lie Met Asn Leu Gin Tyr Thr Glu Ala Asn Arg 

.480 485 490 

1897 

TAT GAC TAT GTC CAC GTG GGG ACC TGG CAT GAA GGA GTG CTG AAT ATT 
Tyr Asp Tyr Val His Val Gly Thr Trp His Glu Gly Val Leu Asn He 
495 : 500 505 

< • 1945 

GAT GAT TAC AAA ATC CAG ATG AAC AAA AGC GGA ATG GTA CGA TCT GTG 
Asp Asp Tyr Lys lie Gin Met Asn Lys Ser Gly Met Val Arg Ser Val 
510 515 520 

v.. 1993 
TGC AGT GAG CCT TGC TTA AAG GGT CAG ATT AAG GTC ATA CGG AAA GGA 
Cys Ser Glu Pro Cys Leu Lys Gly Gin He Lys Val He Arg Lys Gly 
525 530 535 

2041 

GAA GTG AGC TGC TGC TGG ATC TGC ACG GCC TGC AAA GAG AAT GAG TTT 
Glu Val Ser Cys Cys Trp He Cys Thr Ala Cys Lys Glu Asn Glu Phe 
540 545 550 555 

2089 

GTG CAG GAC GAG TTC ACC TGC AGA GCC TGT GAC CTG GGG TGG TGG CCC 
Val Gin Asp Glu Phe Thr Cys Arg Ala Cys Asp Leu Gly Trp Trp Pro 

560 555 570 

2137 

AAC GCA GAG: CTC ACA GGC TGT GAG CCC ATT CCT GTC CGT TAT CTT GAG 
Asn Ala Glu Leu Thr Gly Cys Glu Pro lie Pro Val" Arg Tyr Leu Glu 
575 . 580 585 



TGG AGT GAC ATA GAA TCT ATC ATA GCC ATC GCC TTT TCT TGC CTG GGC 

Trp Ser Asp lie Glu Ser He He Ala He Ala Phe Ser Cys Leu Gly 

590 595 600 

■ . : V ' • ~ ~ "~ ' 2233 

ATC CTC GTG ACG CTG TTT GTC ACC CTC ATC TTC GTT CTG TAC CGG GAC 

He Leu Val Thr Leu Phe Val Thr Leu He Phe Val Leu Tyr Arg Asp 

605 ! 610 615 



2281 

ACA CCC GTG GTC AAA TCC TCC AGT AGG GAG CTC TGC TAT ATC ATT CTG 
Thr Pro Val Val Lys Ser Ser Ser Arg Glu Leu Cys Tyr He He Leu 
620 ; 625 630 635 

~ T ~~ "~*2329 

GCT GGT ATT TTC CTC GGC TAT GTG TGC .CCT TTC ACC CTC ATC GCC AAA 

Ala Gly He Phe Leu Gly Tyr Val Cys Pro Phe Thr Leu Tie Ala Lys 

; 640 645 . 650 
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2377 

CCT ACT ACC ACA TCC TGC TAC CTC CAG CGC CTC CTA GTT GGC CTC TCT 
Pro Thr Thr Thr Ser Cys Tyr Leu Gin Arg Leu Leu Val Gly Leu Ser 
655 660 665 



2425 

TCT GCC ATG TGC TAC TCT GCT TTA GTG ACC AAA ACC AAT CGT ATT GCA 

Ser Ala Met Cys Tyr Ser Ala Leu Val Thr Lys Thr Asn Arg lie Ala 

670 675 680 

■ , . 2473 

CGC ATC CTG GCT GGC AGC AAG AAG AAG ATC TGC ACC CGG AAG CCC AGA 

Arg He Leu Ala Gly Ser Lys Lys Lys lie Cys Thr Arg Lys Pro Arg 

685 690 695 



2521 

TTC ATG AGC GCT TGG GCC CAA GTG ATC ATA GCC TCC ATT CTG ATT AGT 
Phe Met Ser Ala Trp Ala Gin Val He He Ala Ser He Leu He Ser 
700 705 710 715 



2569 

GTA CAG CTA ACA CTA GTG GTG ACC TTG ATC ATC ATG GAG CCT CCC ATG 
Val Gin Leu Thr Leu Val VaT Thr Leu He He Met Glu Pro Pro Met 

720 725 730 

• 

2617 

CCC ATT TTG TCC TAC CCG AGT ATC AAG GAA GTC TAC CTT ATC TGC AAT 
Pro He Leu Ser Tyr Pro Ser lie Lys Glu Val Tyr Leu lie Cys Asn 
735 740 745 



2665 

ACC AGC AAC CTG GGT GTA GTG GCC CCT GTG GGT TAC AAT GGA CTC CTC 
Thr Ser Asn Leu Gl y Val VaT Al a Pro Val Gly Tyr Asn Gly Leu Leu 
750. 755 760 



2713 

ATC ATG AGC TGT ACC TAC TAT GCC TTC AAG ACC CGC AAC GTG CCG GCC 
lie Met Ser Cys Thr Tyr Tyr Ala Phe Lys Thr Arg Asn Val Pro Ala 
765 770 775 



. 2761 

AAC TTC AAT GAG GCT AAA TAC ATC GCC TTC ACC ATG TAC ACT ACC TGC 
Asn Phe Asn Glu Ala Lys Tyr He Ala Phe Thr Met Tyr Thr Thr Cys 
780 . 785 790 795 



2809 

ATC ATC TGG CTG GCT TTC GTT CCC ATT TAC TTT GGG AGC AAC TAC AAG 
He lie Trp Leu Ala Phe Val Pro He Tyr Phe Gly Ser Asn Tyr Lys 

800 805 810 



2857 

ATC ATC ACT ACC TGC TTC GCG GTG AGC CTC AGT GTG ACG GTG GCC CTG 
He He Thr Thr Cys Phe Ala Val Ser Leu Ser Val Thr Val Ala Leu 
815 820 825 

F/a 5£ 
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2905 

GGG TGC ATG TTT ACT CCG AAG ATG TAC ATC ATC ATT GCC AAA CCT GAG 
Gly Cys Met Phe Thr Pro Lys Met Tyr He He He Ala -Lys Pro Glu 
830 835 840 

2953 

AGG AAC GTC CGC AGT GGC TTC ACG ACC TCT GAT GTT GTC CGC ATG CAC 
Arg Asn Val Arg Ser Ala Phe Thr Thr Ser Asp Val Val Arg Met His 
845 . 850 855 

3001 

GTC GGT GAT GGC AAA CTG CCG TGC CGC TCC AAC ACC TTC CTC AAC ATT 
Val Gly Asp Gly Lys Leu Pro Cys Arg Ser Asn Thr Phe Leu Asn He 
860 865 870 875 

3049 

TTC CGG AGA AAG AAG CCC GGG GCA GGG AAT GCC AAT TCT AAC GGC AAG 
Phe Arg Arg Lys Lys Pro Gly Ala Gly Asn Ala Asn Ser Asn Gly Lys 

880 885 890 

3097 

TCT GTG TCA TGG TCT GAA CCA GGT GGA AGA CAG GCG CCC AAG GGA CAG 
Ser Val Ser Trp Ser Glu Pro Gly Gly Arg Gin Ala Pro Lys Gly Gin 
895 900 905 

• 3145 

CAC GTG TGG CAG CGC CTC TCT GTG CAC GTG AAG ACC AAC GAG ACG GCC 

His Val Trp Gin Arg Leu Ser Val His Val Lys Thr Asn Glu Thr Ala 
910 915 920 

• 3193 
TGT AAC CAA ACA GCC GTA ATC AAA CCC CTC ACT AAA AGT TAC CAA GGC 
Cys Asn Gin Thr Ala Val lie Lys Pro Leu Thr Lys Ser Tyr Gin Gly 
925 930 935 

3241 

TCT GGC AAG AGC CTG ACC TTT TCA GAT GCC AGC ACC AAG ACC CTT TAC 
Ser Gly Lys Ser Leu Thr Phe Ser Asp Ala Ser Thr Lys Thr Leu Tyr 
940 945 950 955 

3289 

AAT GTG GAA GAA GAG GAC AAT ACC CCT TCT GCT CAC TTC AGC CCT CCC 
Asn Val Glu Glu Glu Asp Asn Thr Pro Ser Ala His Phe Ser Pro Pro 

960 965 970 

• . ' 3337 

AGC AGC CCT TCT ATG GTG GTG CAC CGA CGC GGG CCA CCC GTG GCC ACC 
Ser Ser Pro Ser Met Val Val His Arg Arg Gly Pro Pro Val Ala Thr 
975 980 985 

3385 

ACA CCA CCT CTG CCA CCC CAT CTG ACC GCA GAA GAG ACC CCC CTG TTC 
Thr Pro Pro Leu Pro Pro His Leu Thr Ala Glu Glu Thr Pro Leu Phe 
990 995 1000 

F/a 5F. 
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CTG 6CT GAT TCC GTC ATC CCC AAG GGC TTG CCT CCT CCT CTC CCG CAG 

Leu Ala Asp Ser Val lie Pro Lys Gly Leu Pro Pro Pro Leu Pro Gin 
1005 1010 1015 

3481 

CAG CAG CCA CAG CAG CCG CCC CCT CAG CAG CCC CCG CAG CAG CCC AAG 

Gin Gin Pro Gin Gin Pro Pro Pro Gin Gin Pro Pro Gin Gin Pro Lys 

1020 1025 1030 1035 

3529 

TCC CTG ATG GAC CAG CTG CAA GGC GTA GTC ACC AAC TTC GGT TCG GGG 
Ser Leu. Met Asp Gin Leu Gin Gly Val Val Thr Asn Phe Gly Ser Gly 

1040 1045 1050 

3577 

ATT CCA GAT TTC CAT GCG GTG CTG GCA GGC CCG GGG AC A CCA GGA AAC 
He Pro Asp Phe His Ala Val Leu Ala Gly Pro Gly Thr Pro Gly Asn 
1055 1060 1065 

3625 

AGC CTG CGC TCT CTG TAC CCG CCC CCG CCT CCG CCG CAA CAC CTG CAG 
Ser Leu Arg Ser Leu Tyr Pro Pro Pro Pro Pro Pro Gin His Leu Gin 
1070 1075 1080 

•. 3673 

ATG CTG CCC CTG CAC CTG .AGC ACC TTC CAG GAG GAG TCC ATC TCC CCT 
Met Leu Pro Leu His Leu Ser Thr Phe Gin Glu Glu Ser He Ser Pro 
1085 1090 1095 

3721 

CCT GGG GAG GAC ATC GAT GAT. GAC AGT GAG AGA TTC AAG CTC CTG CAG 
Pro Gly Glu Asp lie Asp Asp Asp Ser Glu Arg Phe Lys Leu, Leu Gin 
1100 U05 1110 . 1115 

3769 

GAG TTC GTG TAC GAG CGC GAA GGG AAC ACC GAA GAA GAT GAA TTG GAA 
Glu Phe Val Tyr Glu Arg Glu Gly Asn Thr Glu Glu Asp Glu Leu Glu 

1120 1125 1130 

3817 

GAG GAG GAG GAC CTG CCC ACA GCC AGC AAG CTG ACC CCT GAG GAT TCT 
Glu Glu Glu Asp Leu Pro Thr Ala Ser Lys Leu Thr Pro Glu Asp Ser 
.1135 1140 .1145 

3865 

CCT GCC CTG ACG CCT CCT TCT CCT TTC CGA GAT TCC GTG GCC TCT GGC 
Pro Ala Leu Thr Pro Pro Ser Pro Phe Arg Asp Ser Val Ala Ser Gly 
. 1150 1155 1160 

■ 3913 

AGC TCA GTG CCC AGT TCC CCC GTA TCT GAG TCG GTC CTC TGC ACC CCT 

Ser Ser Val . Pro Ser Ser Pro Val Ser Gl u Ser Val Leu Cys Thr Pro 
1165. 1170 1175 

na 5a 
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CCA AAT GTA ACC TAC GCC TCT GTC ATT CTG AGG GAC TAC AAG CAA AGC 

Pro Asn Val Thr Tyr Ala Ser Val He Leu Arg Asp Tyr Lys Gin Ser 
1180 . 1185 1190 1195 

■ " '■ ■-' '' ' 4013 

TCT TCC ACC CTG TAGTGTGTGT GTGTGTGTGG GGGCGGGGGG AGTGCGCATG 
Ser Ser Thr Leu 



. • • 4073 

GAGAAGCCAG -AGATGCCAAG GAGTGTCAAC CCTTCCAGAA ATGTGTAGAA AGCAGGGTGA 

4133 

GGGATGGGGA TGGAGGACCA CGGTCTGCAG GGAAGAAAAA AAAAATGCTG CGGCTGCCTT 

"' ■ ■'■4I93 

AAAGAAGGAG AGGGACGATG CCAACTGAAC AGTGGTCCTG GCCAGGATTG TGACTCTTGA 

-• 4253 

ATTATTCAAA AACCTTCTCT, AGAAAGAAAG GGAATTATGA CAAAGCACAA TTCCATATGG 

' ■ ■ 4300 "• 
TATGTAACTT TTATCGAAAA AAA AAA AAA A 'AAAAAAAAAA- AAAAAAA. 



F/a 5H. 
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60 

TAAGAATTTT ATAAATACTC TGGGAATTTT ATTGGTGATG CCTTTGTGTC TACAGGGCAC 

i • ■ ? ■' 120 
ACGTTCCAGA GAGCTCTGGT GTGAAGTGAT GGGGGACTTG TGGCTAGAGA AGCTTTTCAA 

■ \ 180 
TGGCCTTAAA CTCTGGGTCC TGCTTGAGAG AGGTCTGAGG TTCTCAACAT CAGAGCAGAG 

240 

CTTCCACCAA GCTTTCAGAA TGCTAAGCCC CCACTTCTCA ACACTTAGTG CTCTGATCGG 

300 

TGCCTGCGAA CCGAGAACGG CTGCAGTCCT CTGACCTGAG ACCAATAGCT GTGTCTAGCC 

360 

GGACTCAGCG TCCAGGTCAC CGCCACTAAC GCGCCGCGCA TTGGACACCT GATCCACACA 

420 

CCTTCGGGCA CCAGTGAAAA ACCGCGACTT GATTTTGTGG AAGAACGCCC CCAGGGTGTG 

480 

GGAGCGGTCG TGGAGGACCA GCAGGAGGAA GCGGAGGGGA GAGGGGCAGT AGTGGAGGGA 

. 540 

GAGAAAGCGT TGAACCAGCT GTGTTGGCCG AAGGCACGAA ACGGCAAAAG GCAGCGGTGA 

600 

GCATCTGTGT GGTTCCCGCT GGGAACCTGC AGGCAGGACC GGCGTGGGAA CGTGGCTGGC 

CCGCGGTGGA CCGGGTCTTC GCCACA ATG GTC CGG CTC CTC TTG ATT TTC TTC 

Met Val Arg Leu-Leu Leu He Phe Phe 

1 5 . 

CCA ATG ATC TTT TTG GAG ATG TCC ATT TTG CCC AGG ATG CCT GAC AGA 
Pro Met He Phe Leu Glu Met Ser He Leu Pro Arg Met Pro Asp Arg 
10 15. 20 25. 

749 

AAA GTA TTG CTG GCA GGT GCC TCG TCC CAG CGC TCC GTG GCG AGA ATG 
Lys Val Leui Leu Ala Gly Ala Ser Ser Gin Arg Ser Val Ala Arg Met , 
\ 30 35 40 

1 " . 797..-' 

GAC GGA GAT GTC ATC ATC GGA GCC CTC TTC TCA GTC CAT CAC CAG CCT 

Asp Gly Asp ! Val He He Gly Ala Leu Phe Ser Val His His Gin Pro 

i 45 50 55 

\- : ' 845' " 

CCA GCC GAG.AAG GTA CCC GAA AGG AAG TGT GGG GAG ATC AGG GAA CAG 
Pro Ala Glu ! Lys Val Pro Glu Arg Lys Cys Gly Glu He Arg Glu Gin 
60 1 65 70 

893 

TAT GGT ATC | CAG AGG GTG GAG GCC ATG TTC CAC ACG TTG GAT AAG ATT 

F/a 7A 
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Tyr Gly He Gin Arg Val Glu Ala Met Phe His Thr Leu Asp Lys He. 

• 75 80 85 

941 

AAC GCG GAC CCG GTG CTC CTG CCC AAC ATC ACT CTG GGC AGT GAG ATC 

Asn Ala Asp Pro Val Leu Leu Pro Asn lie Thr Leu Gly Ser Glu He 
90 95 100 105 

989 

CGG GAC TCC TGC TGG CAC TCT TCA GTG GCT CTC GAA CAG AGC ATC GAA 
Arg Asp Ser Cys Trp His Ser Ser Val Ala Leu Glu Gin Ser He Glu 

110 115 120 

1037 

TTC ATC AGA GAC TCC CTG ATT TCC ATC CGA GAT GAG AAG GAT GGG CTG 
Phe He Arg Asp Ser Leu lie Ser lie Arg Asp Glu Lys Asp Gly Leu 
125 . 130 . 135 

1085 

AAC CGA TGC CTG CCT GAT GGC CAG ACC CTG CCC CCT GGC AGG ACT AAG 
Asn Arg Cys Leu Pro Asp Gly Gin Thr Leu. Pro Pro Gly Arg Thr Lys 
140 145 150 

1133 

AAG CCT ATT GCT GGA GTG ATC GGC CCT GGC TCC AGC TCT GTG GCC ATT 
Lys Pro He Ala Gly Val lie Gly Pro Gly Ser Ser Ser Val Ala He 
155 160 165 

1181 

CAA GTC CAG AAT CTT CTC CAG CTG TTC GAC ATC CCA CAG ATC GCC TAT 
Gin Val Gin Asn Leu Leu Gin Leu Phe Asp He Pro Gin He Ala Tyr 
170 175 180 185 

1229 

TCT GCC ACA AGC . ATA GAC CTG AGT GAC AAA ACT TTG TAC AAA TAC TTC 
Ser Ala Thr Ser He Asp Leu Ser Asp Lys Thr Leu Tyr Lys Tyr Phe 

190 195 200 

1277 

CTG AGG GTG GTC CCT TCT GAC ACT TTG CAG GCA AGG GCG ATG CTC GAC 
Leu Arg Val Val Pro Ser Asp Thr Leu Gin Ala Arg Ala Met Leu Asp 
205 210 215 

1325 

ATA GTC AAG CGT TAC AAC TGG ACC TAT GTC TCA GCA GTC CAC ACA GAA 
He Val Lys Arg Tyr Asn Trp Thr Tyr Val Ser Ala Val His Thr Glu 
220 225 230 

1373 

GGG AAT TAC GGC GAG AGT GGA ATG GAT GCT TTC AAA GAA CTG GCT GCC 
Gly Asn Tyr Gly Glu Ser Gly Met Asp Ala Phe Lys Glu Leu Ala Ala 
; 235 240 245 

, 1421 
CAG GAA GGC CTC TGC ATC GCA CAC TCG GAC AAA ATC TAC AGC AAT GCT 
Gin Glu Gly Leu Cys He Ala His Ser Asp Lys He Tyr Ser Asn Ala 
250 255 260 265 

F/G 7B. 
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1469 

GGC GAG AAG AGC TTT GAC CGG CTC CTG CGT AAA CTC CGG GAG CGG CTT 
Gly Glu Lys Ser Phe Asp Arg Leu Leu Arg Lys Leu Arg Glu Arg Leu 

270 275 280 

1517 

CCC AAG GCC AGG GTT GTG GTC TGG TTC TGC GAG GGC ATG ACA GTG CGG 
Pro Lys Ala Arg Val Val Val Cys Phe Cys Glu Gly Met Thr Val Arg 
285 290 295 

1565 

GGC TTA CTG AGT GCC ATG CGC CGC CTG GGC GTC GTG GGC GAG TTC TCA 
Gly Leu Leu Ser Ala Met Arg Arg Leu Gly Val Val Gly Glu Phe Ser 
300 305 310 

1613 

CTC ATT GGA AGT GAT GGA TGG GCA GAC AGA GAT GAA GTC ATC GAA GGC 
Leu He Gly Ser Asp Gly Trp Ala Asp Arg Asp Glu Val He Glu Gly 
,315 320 325 

♦ 1661 

TAT GAG GTG GAA GCC AAC GGA GGG ATC ACA ATA AAG CTT CAG TCT CCA 
Tyr Glu Val Glu Ala. Asn Gly Gly He Thr He Lys Leu Gin Ser Pro 
330 335 • 340 345 

1709 

GAG GTC AGG TCA TTT GAT GAC TAC TTC CTG AAG CTG AGG CTG GAC ACC 
.Glu Val Arg Ser Phe 1 Asp Asp Tyr Phe Leu Lys Leu Arg Leu Asp Thr 

350 355 360 

, 1757 
AAC ACA AGG AAT CCT TGG TTC CCT GAG TTC TGG CAA CAT CGC TTC CAG 
Asn Thr Arg Asn Pro Trp Phe Pro Glu Phe Trp Gin His Arg Phe Gin 
365 370 375 

1805 

TGT CGC CTA CCT GGA CAC CTC TTG GAA AAC CCC AAC TTT AAG AAA GTG 
Cys Arg Leu Pro Gly His Leu Leu Glu Asn Pro Asn Phe Lys Lys Val 
380 385 390 

. 1853 

TGC ACA GGA AAT GAA AGC TTG GAA GAA AAC TAT GTC CAG GAC AGC AAA 
Cys Thr Gly Asn Glu Ser Leu Glu Glu Asn Tyr Val Gin Asp Ser. Lys 
. 395 400 405 

1901 

ATG GGA TTT GTC ATC AAT GCC ATC TAT GCC ATG GCA CAT GGG CTG CAG 
Met Gly Phe Val He Asn Ala He Tyr Ala Met Ala His Gly Leu Gin 
410 415 420 425 

1949 

AAC ATG CAC CAT GCT CTG TGT CCC GGC CAT GTG GGC CTG TGT GAT GCT 
Asn Met His His Ala Leu Cys Pro Gly His Val Gly Leu Cys Asp Ala 

430 435 440 

F/a 7C 1997 
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ATG AAA CCC ATT GAT GGC AGG AAG CTC CTG GAT TTC CTC ATC AAA TCC 
Met Lys Pro He Asp Gly Arg Lys Leu Leu Asp Phe Leu He Lys Ser 
445 450 455 

2045 

TCT TTT GTC GGA GTG TCT GGA GAG GAG GTG TGG TTC GAT GAG AAG GGG 
Ser Phe Val Gly Val Ser Gly Glu Glu Val Trp Phe Asp Glu Lys Gly 
460 465 470 

2093 

GAT GCT CCC GGA AGG TAT GAC ATT ATG AAT CTG CAG TAC ACA GAA GCT 
Asp Ala Pro Gly Arg Tyr Asp He Met Asn Leu Gin Tyr Thr Glu Ala 
475 480 485 

2141 

AAT CGC TAT GAC TAT GTC CAC GTG GGG ACC TGG CAT GAA GGA GTG CTG 
Asn Arg Tyr Asp Tyr Val His Val Gly Thr Trp His Glu Gly Val Leu 
490 495 500 505 

2189 

AAT ATT GAT GAT TAC AAA ATC CAG ATG AAC AAA AGC GGA ATG GTA CGA 
Asn He Asp Asp Tyr Lys He Gin Met Asn Lys Ser Gly Met Val Arg 

510 515 520 

2237 

TCT GTG TGC AGT GAG CCT TGC TTA AAG GGT CAG ATT AAG GTC ATA CGG 
Ser Val Cys Ser Glu Pro Cys Leu Lys Gly Gin He Lys Val lie Arg 
525 530 535 

2285 

AAA GGA GAA GTG AGC TGC TGC TGG ATC TGC ACG GCC TGC AAA GAG AAT 
Lys Gly Glu Val Ser Cys Cys Trp He Cys Thr Ala Cys Lys Glu Asn 
540 545 550 

2333 

GAG TTT GTG CAG GAC GAG TTC ACC TGC AGA GCC TGT GAC CTG GGG TGG 
Glu Phe Val Gin Asp Glu Phe Thr Cys Arg Ala Cys Asp Leu Gly Trp 
555 560 565 

2381 

TGG CCC AAC GCA GAG CTC ACA GGC TGT GAG CCC ATT CCT GTC CGT TAT 
Trp Pro Asn Ala Glu Leu Thr Gly Cys Glu Pro He Pro Val Arg Tyr 
570 575 580 585 

2429 

CTT GAG TGG AGT GAC ATA GAA TCT ATC ATA GCC ATC GCC TTT TCT TGC 
Leu Glu Trp Ser Asp He Glu Ser He He Ala He Ala. Phe Ser Cys 

590 595 600 

2477 

CTG GGC ATC CTC GTG ACG CTG TTT GTC ACC CTC ATC TTC GTT CTG TAC 
Leu Gly He Leu Val Thr Leu Phe Val Thr Leu He Phe Val Leu Tyr 
605 610 615 

2525 

CGG GAC ACA CCC GTG GTC AAA TCC TCC AGT AGG GAG CTC TGC TAT ATC 
Arg Asp Thr Pro Val Val Lys Ser Ser Ser Arg Glu Leu Cys Tyr He 

FIG. 70. 
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2573 

ATT CTG GCT GGT ATT TTC CTC GGC TAT GTG TGC CCT TTC ACC CTC ATC 
He Leu Ala Gly He Phe Leu Gly Tyr Val Cys Pro Phe Thr Leu He 
635 640 645 

GCC AAA CCT ACT ACC ACA TCC TGC TAC CTC CAG CGC CTC CTA GTT^GGC 
Ala Lys Pro Thr Thr Thr Ser Cys Tyr Leu Gin Arg Leu Leu Val Gly 
650 655 660 665 

2669 

CTC TCT TCT GCC ATG TGC TAC TCT GCT TTA GTG ACC AAA ACC A AT CGT 
Leu Ser Ser Ala Met Cys Tyr Ser Ala Leu Val Thr Lys Thr Asn Arg 

670 675 680 

. 2717 

ATT GCA CGC ATC CTG GCT GGC AGC AAG AAG AAG ATC TGC ACC CGG AAG 
He Ala Arg He Leu Ala Gly Ser Lys Lys Lys He Cys Thr Arg Lys 
685 690 695 

2765 

CCC AGA TTC ATG AGC GCT TGG GCC CAA GTG ATC ATA GCC TCC ATT CTG 
Pro Arg Phe Met Ser Ala Trp Ala Gin Val He lie Ala Ser He Leu 
700 705 710 

2813 

ATT AGT GTA CAG CTA ACA CTA GTG GTG ACC TTG ATC ATC ATG GAG CCT 
He Ser Val Gin Leu Thr Leu Val Val Thr Leu He He Met Glu Pro 
715 720 725 

' ' ' . . ■ . : ■'',-■■-■-•*■*■-■ ' • ; - ■ — - 2861 

CCC ATG CCC ATT TTG TCC TAC CCG AGT ATC AAG GAA GTC TAC CTT ATC 

Pro Met Pro He Leu Ser Tyr Pro Ser lie Lys Glu Val Tyr Leu He 

730 735 740 745 

2909 

TGC AAT ACC AGC AAC CTG GGT GTA GTG GCC CCT GTG GGT TAC AAT GGA 
Cys Asn Thr Ser .Asn Leu Gly Val Val Al a Pro Val Gly Tyr Asn Gly 

750 755 760 

, 2957 

CTC CTC ATC. ATG AGC TGT ACC TAC TAT GCC TTC AAG ACC CGC AAC GTG 
Leu Leu He Met Ser Cys Thr Tyr Tyr Ala Phe Lys Thr Arg Asn Val 
765 770 775 

3005 

CCG GCC AAC TTC AAT GAG GCT AAA TAC ATC GCC TTC ACC ATG TAC ACT 
Pro Ala Asn Phe Asn Glu Ala Lys Tyr He Ala Phe Thr Met Tyr Thr 
780 785 790 

3053 

ACC TGC ATC ATC TGG CTG GCT TTC GTT CCC ATT TAC TTT GGG AGC AAC 
Thr Cys He He Trp Leu Ala Phe Val Pro He Tyr Phe Gly Ser Asn 
795 - 800 " 805 

F/a 7E 
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3101 

TAC AAG ATC ATC ACT ACC T6C TTC GCG GTG AGC CTC AGT GTG ACG GTG 
Tyr Lyslle He Thr Thr Cys Phe Ala Val Ser Leu Ser Val Thr Val 
810 815 820 825 

'■ ; 3149 

GCC CTG GGG TGC ATG TTT ACT CCG AAG ATG TAC ATC ATC ATT GCC AAA 
Ala Leu Gly Cys Met Phe Thr Pro Lys Met Tyr He He He Ala Lys 

• 830 835 840 

. 3197 

CCT GAG AGG AAC GTC CGC AGT GCC TTC ACG ACC TCT GAT GTT GTC CGC 
Pro Glu Arg Asn Val Arg Ser Ala Phe Thr Thr Ser Asp Val Val Arg 

■: 845 '; 850 855 

3245 

ATG CAC GTC GGT GAT GGC AAA CTG CCG TGC CGC TCC AAC ACC TTC CTC 
Met His Val Gly Asp Gly Lys Leu Pro Cys Arg Ser Asn Thr Phe Leu 
860 865 870 

3293 

AAC ATT TTC CGG AGA AAG AAG CCC GGG GCA GGG AAT GCC AAG AAG AGG 
Asn lie Phe Arg Arg Lys Lys Pro Gly Ala Gly Asn Ala Lys Lys Arg 
875 v 880 885 

' • '3341 

CAG CCA GAA TTC TCG CCC AGC AGC CAG TGT CCG TCG GCA CAT GCG CAG 
Gin Pro Glu Phe Ser Pro Ser Ser Gin Cys Pro Ser Ala His Ala Gin 
890 895 900 905 . 

■ 3394 

CTT TGAAAACCCC CACACTGCAG TGAATGTTTC TAACGGCAAG TCTGTGTCAT 

•.- 3454 

GGTCTGAACC AGGTGGAAGA CAGGCGCCCA AGGGACAGCA CGTGTGGCAG CGCCTCTCTG 

' ■ ! ■ .,- ■ .• 3514 

TGCACGTGAA GACCAACGAG ACGGCCTGTA ACCAAACAGC CGTAATCAAA CCCCTCACTA 

; 3574 
AAAGTTACCAi AGGCTCTGGC AAGAGCCTGA CCTTTTCAGA TGCCAGCACC AAGACCCTTT 

I , • -3634 

AC AATGTGGA ; AGAAG AGGAC AATACCCCTT CTGCTCACTT CAGCCCTCCC AGCAGCCCTT 

. T 3694 
CTATGGTGGT IGCACCGACGC GGGCCACCCG TGGCCACCAC ACCACCTCTG CCACCCCATC 

■ j ; "' ' 3754. 

TGACCGCAGA AGAGACCCCC CTGTTCCTGG CTGATTCCGT CATCCCCAAG GGCTTGCCTC 

■ 3814 

CTCCTCTCCC iGCAGCAGCAG CCACAGCAGC CGCCCCCTCA GCAGCCCCCG CAGCAGCCCA 

'r - • • - - 3874 

AGTCCCTGAT iGGACCAGCTG CAAGGCGTAG TCACCAACTT CGGTTCGGGG ATTCCAGATT 

F/a 7E 
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.3934 

TCCATGCGGT GCTGGCAGGC CCGGGGACAC CAGGAAACAG CCTGCGCTCT CTGTACCCGC 

3994 

CCCCGCCT.CC GCCGCAACAC CTGCAGATGC TGCCCCTGCA CCTGAGCACC TTCCAGGAGG 

•' 4054 

AGTCGATCTC CCCTCCTGGG GAGGACATCG ATGATGACAG TGAGAGATTC AAGCTCCTGC 

•" : - ' 4114' 

AGGAGTTCGT GTACGAGCGC GAAGGGAACA CCGAAGAAGA TGAATTGGAA GAGGAGGAGG 

V, " " ■ - -4174 

ACCTGCGCAC AGCCAGCAAG GTGACCCCTG AGGATTCTCC TGCCCTGACG CCTCCTTCTC 

. " ' 4234 

CTTTCCGAGA TTGCGTGGCC TCTGGCAGCT CAGTGCCCAG TTCCCCCGTA. TCTGAGTCGG 

: - 4294 

TCCTCTGCAC CCCTCCAAAT GTAACCTACG CCTCTGTCAT TCTGAGGGAC TACAAGCAAA 

■■' : ■ . ' 4354- 

GCTCTTCCAC CCTGTAGTGT GTGTGTGTGT GTGGGGGCGG GGGGAGTGCG CATGGAGAAG 

' • ' ■'' '. ~ ' '■ 4414 

CCAGAGATGC CAAGGAGTGT CAACCCTTCC AGAAATGTGT AGAAAGCAGG. GTGAGGGATG 

4474' 

GGGATGGAGG ACCACGGTCT GCAGGGAAGA AAAAAAAAAA. TGCTGCGGCT GCCTTAAAGA 

' ' ' . ■ 4534 

AGGAGAGGGA CGATGCCAAC TGAACAGTGG TCCTGGCCAG GATTGTGACT CTTGAATTAT 

V . ■ . '4594 

TCAAAAACCT TCTCTAGAAA GAAAGGGAAT TATGACAAAG CACAATTCCA TATGGTATGT 

■V " '■ 4654 

AACTTTTATC GAAAAAAATA ATAAAACGTA AAA AT AAA AT CAACAAAAAT AATCTCTTCT 

' . ■ 4714 

TTTGCTCAAT CGTGCATACA TATATCTGCC CACACTCCCG TGGTAAAACT AGAAGCGAAG 

1 : • ■ ■-■' ■ ■'••>'"! . "■' . 4774 

CAGGCCCTGC GATGGTGCCA ACTGAATCCT AAGTTCATCA TCCTAGTGAG CAGATGGAGA 

■ "i '' : . " ' / : .' 4834 

GAGGGCAGGA GGCGAGAGGG CAGGAGGCGG GGGTAGGTTC GGACAACAGC TCCCATCTCA 

. ". 4894 

GACCTTGACT GTGCTGAGTC TTCAGACTCC TGGACTAAGG AAGACCCGGG GACTGACCTT 

' '. ■ .'4954 

ATGAGGGTCC. CTTTCCACTG CTGTGATGCA , TTGCCAGCCT GTAGTCACCC GGGATAAAGG 

- ' ■-- .'- -. ' - - *• ' ■ ■■■ ■ ■ ■ --. 5014 

CACAGTAACC TTTTGCATTC CTGTGATTCC GTGTGTTTAA GGAAAAGGAA AGTATGAGCA 
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5074 

AAGCTATCAC CAAAAAGAGC GCCATTAGAA GTTACGGGGG AGAAAAAAAG AGAAGCAAGA 

■ ' '5134 

TGATATATAA GCACAGGGCC TTGAACAAGG TGAGCGTGCT TCACAGATTC CGTATTAATG 

. 5194 

TACAGATACT TTTGGAGAGG AGAAAGATAA CAAGGAGTGT CAGGCCGTTT GTGAACTCAC 

. 5236 

TTGCACTGTG CCAACCAGGT TGTCCGCTGC CCTTCAGCAA AA 

F/a 7H 
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60 

CCCGGGCTCC CGGCAGTGCG AGCAGCTAAG GGCTGGCCGC CGCCTCCCTG AGCTCCCCCG 

120 

GMGCAGCCGA CCCCTGGTCG CGGCGTTCAC CTCGCCGATG CGCGGTTGGT AGGAGTGACC 

180 

GGAGCCATTC TCTCCTCGTT GATAAGATTC CCTACCAGGA TAGGAGCCTA TCTCCCTTTY 

• . 240 

CACAGCAGGA CACAGAAATC TGGCCTTCAG TACTTTGGGA AAAGGATCTG AGACCTCCTG 

300 

GAGCTCTGAC CACTGGCTGT CATCTGTGGG TCTGGCCTGT GTGGGCCACT GAGCTCTACT 

360 

CAAACATTAA AGAGGAGGAG GGGAGATCTG TGGAATGGGC CACCCCGTTG GCCTGCTGCA 

420 

TTACTGAACC TGCGCTGTCC ACACGTGCCC AGATCATGGG ACCCAGGGCC TGCTAGGGCT 

475 

AGGAGCGGGG CCCAGTATTC ATGGGTCTCT AGGCCTTTCC GAA ATG TCC GGG AAG 

Met Ser Gly Lys 

523 

GGA GGC TGG GCC TGG TGG TGG GCC CGG CTG CCC CTC TGC CTA CTC CTC 
Gly Gly Trp Ala Trp Trp Trp Ala Arg Leu Pro Leu Cys Leu Leu Leu 
5 . 10 15 20 

571 

AGC CTT TAT GCC CCC TGG GTG CCT TCA TCC TTG GGA AAG CCC AAG GGT 
Ser Leu Tyr Ala Pro Trp Val Pro Ser Ser Leu Gly Lys Pro Lys Gly 

25 . 30 3" 



5 
619 

CAC CCC CAC ATG AAC TCT ATC CGA ATT GAC GGG GAC ATC ACA CTG GGA 
His Pro His Met Asn Ser He Arg He Asp Gly Asp He Thr Leu Gly 

40 45 50 

• • • 667' 

GGC CTG TTT CCC GTC CAC GGC CGT GGC TCT GAG GGT AAG GCC TGC GGG 

Gly Leu Phe Pro Val His Gly Ara Gly Ser.GluGly Lys Ala Cys Gly 
55 60 65 

' : • ■-' 715 

GAG CTG AAG AAG GAG AAA GGC ATC CAC CGC CTG GAG GCC ATG CTG TTT 
Glu Leu Lys Lys Glu Lys Gly He His Arg Leu Glu Ala Met Leu Phe 
70 75 80 

763 

GCC CTG GAC CGC ATC AAC AAT GAC CCG GAC CTA CTG CCC AAC ATC ACG 
Ala Leu Asp Arg lie Asn Ash Asp. Pro Asp Leu Leu Pro Asn lie Thr 
85 90 95, . 100 

FIG. 8 A 
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811 

TTG GGC GCC CGC ATT CTG GAC ACC TGC TCG AGG GAC ACC CAC GCC CTG 
Leu Gly Ala Arg lie Leu Asp Thr Cys Ser Arg Asp Thr His Ala Leu 

105 110 115 

859 

GAG CAG TCA.CTG ACC TTT GTG CGG GCG CTC ATC GAG AAG GAC GGC ACG 
Glu Gin Ser Leu Thr Phe Val Arg Ala Leu He Glu Lys Asp Gly Thr 
120 125 130 

907 

GAG GTC CGC TGC GGC AGG CGG GGC CCG CCC ATC ATC ACC AAG CCC GAA 
Glu Val Arg Cys Gly Arg Arg Gly Pro Pro He He Thr Lys Pro Glu 
135 140 145 

955 

CGA GTG GTG GGT GTC ATT GGA GCT TCG GGG AGC TCC GTC TCG ATC ATG 
Arg Val Val Gly Val He Gly Ala Ser Gly Ser Ser Val Ser He Met 
150 155 160 

1003 

GTG GCC AAC ATC CTC CGC CTC TTC AAG. ATC CCT CAG ATC AGC TAT GCC 
Val Ala Asn He Leu Arg Leu Phe Lys He Pro Gin He Ser Tyr Ala 
165 170 175 180 

1051 

TCC ACG GCC CCT GAC TTG AGT GAC AAC AGC CGC TAT GAC TTC TTC TCC 
Ser Thr Ala Pro Asp Leu Ser Asp Asn Ser Arg Tyr Asp Phe Phe Ser 

185 190 195 

1099 

CGG GTG GTG CCC TCA GAC ACA TAC CAG GCC CAG GCC ATG GTG GAT ATT 
Arg Val Val Pro Ser Asp Thr Tyr Glh Ala Gin Ala Met Val Asp He 
200 205 210 

1147 

GTC CGA GCC CTC AAG TGG AAC TAT GTG TCC ACA CTG GCC TCA GAG GGC 
Val Arg Ala Leu Lys Trp Asn Tyr Val Ser Thr Leu Ala Ser Glu Gly 
215 220 225 

1195 

AGC TAC GGT GAG AGT GGT GTG GAG GCC TTT ATC CAG AAG TCC CGA GAG 
Ser Tyr Gly Glu Ser Gly Val Glu Ala Phe lie Gin Lys Ser Arg Glu 
230 235 240 

1243 

AAC GGA GGT GTG TGC ATT GCC CAG TCG GTG AAG ATT CCA CGG GAA CCC 
Asn Gly. Gly Val Cys He Ala Gin Ser Val Lys He Pro Arg Glu Pro 
245 250 255 260 

1291 

AAG ACG GGG GAG TTC GAC AAG ATC ATC AAA CGC CTA CTG GAA ACA TCC 
Lys Thr Gly Glu Phe Asp Lys lie He Lys Arg Leu Leu Glu Thr Ser 

265 270 275 

F/a 8B. 
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AAT GCC AGG GGT ATC ATC ATC TTT GCC AAC GAG 
Asn Ala Arg Gl^ He lie lie Phe AU Asn Glu 



28 



285 



GTG TTG GAG GCA GCT CGC AGG GCC AAC CAG ACC 
Val Leu Glu Ala Ala Arg Arg Ala Asn Gin Thr 
295 5 300 



ATG GGT TCT GAT AGC TGG GGC TCC AAG AGT GCC 
Met Gly Ser Asp Ser Trp Gly Ser Lys Ser Ala 

310 ■ 315 



GAG GAG GTG GCC GAG GGC GCA GTC ACC ATT CTC 
Glu Glu Val Ala Glu Gly Ala Val Thr He Leu 
325 330 335 



GTT CGA GGG TTC GAC CGA TAG TTC TCC AGC CGC 
Val Arg Gly Phe Asp Arg Tyr Phe Ser Ser Arg 

345 350 



AGG CGC AAC ATC TGG TTT GCC GAG TTC TGG GAG 
Arg Arg Asn He Trp Phe Ala Glu Phe Trp Glu 
360 365 



AAG TTG AGC CGC CAC GCG CTC AAG AAG GGA AGC 
Lys Leu Ser Arg His Ala Leu Ly^s Lys Gly Ser 



375 



380 



ACC AAC CGA GAG CGC ATC GGG CAG GAC TCG GCC 
Thr Asn Arg Glu Arg He Gly^ Gin Asp Ser Ala 



390 



39 



AAG GTG CAG TTC GTG ATT GAC GCT GTG TAC GCC 
Lys Val Glh Phe Val He Asp Ala Val Tyr Ala 
405 410 415 



CAC GCC ATG CAC CGT GAC CTG TGT CCC GGC CGC 
His Ala Met His Arg Asp Leu Cys Pro Gly Arg 
! 425 430 



CGC ATG GAC CCC GTG GAT GGC ACC CAG CTG CTT. 
Arg Met Aspi Pro Val Asp Gly Thr Gin Leu Leu 
1 440 445 



1339 

GAT GAC ATC AGG AGG 
Asp Asp He Arg Arg 
290 

1387 

GGC CAC TTC TTT TGG 
Gly His Phe Phe Trp 
305 

1435 

CCT GTG CTG CGC CTT 
Pro Val Leu Arg Leu 
320 

1483 

CCC AAG AGG ATG TCT 
Pro Lys Arg Met Ser 

340 

1531 

ACG CTG GAC AAC AAC 
Thr Leu Asp Asn Asn 
355 

1579 

GAC AAC TTC CAT TGC 
Asp Asn Phe His Cys 
370 

1627 

CAC ATC AAG AAG TGC 
His He Lys Lys Cys 
385 

1675 

TAT GAG CAG GAG GGG 
Tyr Glu Gin Glu Gly 
400 

1723 

ATG GGC CAC GCG CTG 
Met Gly His Ala Leu 

420 

1771 

GTA GGA CTC TGC CCT 
Val Gly Leu Cys Pro 

43 5 

1819 

AAG TAC ATC AGG AAC 
Lys Tyr He Arg Asn 
450 



Fia 8C 
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1867 

GTC AAC TTC TCA GGC ATT GCG GGG AAC CCT GTA ACC TTC AAT GAG AAC 
Val Asn Phe Ser Gly He Ala Gly Asn Pro Val Thr Phe Ash Glu Asn 
455 460 465 

- 1915 

GGA GAC GCA CCG GGG CGC TAC GAC ATC ■ TAC CAG TAC CAA CTG CGC AAT 

Gly Asp Ala Pro Gly Arg Tyr Asp He Tyr Gin Tyr Gin Leu Arg Asn 
470 475 480 

1963 

GGC TCG GCC GAG TAC AAG GTC ATC GGC TCG TGG ACA GAC CAC CTG CAC 

Gly Ser Ala Glu Tyr Lys Val. He Gly Ser Trp.Thr Asp His Leu His 

485 .490 495 500 

2011 

CTC AGA ATA GAG CGG ATG CAG TGG CCA GGG AGT GGC CAG CAG CTG CCG 
Leu Arg He Glu Arg Met Gin Trp Pro Gly Ser Gly Gin Gin Leu Pro 

505 , 510 515 

■" • -' 2059 

CGC TCC ATC TGC AGT CTG CCC TGC CAG CCC GGG GAG CGA AAG AAG ACT 
Arg Ser He Cys Ser Leu Pro Cys Gin Pro Gly Glu Arg Lys Lys Thr 
520 . 525 . ; 530 

2107 

GTG AAG GGC ATG GCT TGC TGC TGG CAC TGC GAG CCC TGC ACC GGG TAC 
Val Lys Gly Met Ala Cys Cys Trp His Cys Glu Pro Cys Thr Gly Tyr 
535 540 545 

2155 

CAG TAC CAA GTG GAC CGC TAC ACC TGT AAG ACC TGC CCC TAC GAC ATG 
Gin Tyr Gin Val Asp Arg Tyr Thr Cys Lys Thr Cys Pro Tyr Asp Met 

550 ... 555 560 . 

2203 

CGG CCC ACA GAG AAC CGC ACG AGC TGC CAG CCC. ATC CCC ATC GTC AAG 
Arg Pro Thr Glu Ash Arg Thr Ser Cys Gin Pro He Pro lie Val Lys 
565 570 575 580 

2251 

TTG GAG TGG GAC TCG CCG- TGG GCC GTG CTG CCC CTC TTC CTG GCC GTG 
Leu Glu Trp Asp Ser Pro Trp, Ala Val Leu Pro Leu Phe Leu Ala Val 

585 . 590 595 

' ' ' 2299 

GTG GGC ATC GCC GCC ACG CTG TTC GTG GTG GTC ACG TTT GTG CGC TAC 

Val Gly He Ala Ala Thr Leu Phe Val Val Val Thr Phe Val Arg Tyr 

600 605 610 

2347 

AAC GAT ACC CCC ATC GTC AAG GCC TCG GGC CGG GAG CTG AGC TAC GTG 

Asn Asp Thr Pro He Val Lys Ala Ser Gly Arg Glu Leu Ser Tyr Val 
615 620 625 

na sa 



SUBSTITUTE SHEET 



WO 92/10583 PCT/US91/09422 

27/32 2395 

CTG CT6 GCG GGC ATC TTT CTG TGC TAC GCC ACT ACC TTC CTC ATG ATC 

Leu Leu Ala Gly He Phe Leu Cys Tyr Ala Thr Thr Phe Leu Met lie 
630 635 640 

2443 

GCA GAG CCG GAC CTG GGG ACC TGT TCG CTC CGC CGC ATC TTC CTA GGG 

Ala Glu Pro Asp Leu Gly Thr Cys Ser Leu Arg Arg He Phe Leu Gly 

645 650 655 , 660 

2491 

CTC GGC ATG AGC ATC AGC TAC GCG GCC CTG CTG ACC AAG ACC AAC CGC 
Leu Gly Met Ser He Ser Tyr Ala Ala Leu Leu Thr Lys Thr Asn Arg 

665 670 675 

2539 

ATT TAC CGC ATC TTT GAG CAG GGC AAA CGG TCG GTC AGT GCC CCG CGT 
He Tyr Arg lie Phe Glu Gin Gly Lys Arg Ser Val Ser Ala Pro Arg 
680 685 690 

• ' • 2587 
TTC ATC AGC CCG GCC TCG CAG CTG GCC ATC ACC TTC ATC CTC ATC TCC 
Phe He Ser Pro Ala Ser Gin Leu Ala He Thr Phe He Leu He Ser 
.695 700 705 

2635 

CTG CAG CTG CTC GGC ATC TGC GTG TGG TTC GTG GTG GAC CCC TCC CAC 
Leu Gin Leu Leu Gly He Cys Val Trp Phe Val Val Asp Pro Ser His 
710 715 720 

2683 

TCG GTG GTG GAC TTC CAG GAC CAA CGG ACA CTT GAC CCC CGC TTT GCC 
Ser Val Val Asp Phe Gin Asp Gin Arg Thr Leu Asp Pro Arg Phe. Ala 
725 . 730 735 740 

2731 

AGG GGC GTG CTC AAG TGC GAC ATC TCG GAC CTG TCC CTC ATC TGC CTG 
Arg Gly Val Leu Lys Cys Asp He Ser Asp Leu Ser Leu He Cys Leu 

745 750 755 

2779 

CTG GGC TAC AGC ATG CTG CTG ATG GTC ACG TGT ACT GTG TAC GCC ATC 
Leu Gly Tyr Ser Met Leu Leu Met Val Thr Cys Thr Val Tyr Ala lie 
760 765 770 

2827 

AAG ACC CGA GGC GTG CCC GAG ACC TTC AAC GAG GCC AAG CCC ATC GGC 
Lys Thr Arg Gly Val Pro Glu Thr Phe Asn Glu Ala Lys Pro He Gly 
775 780 785 

2875 

TTC ACC ATG TAC ACC ACC TGC ATT GTC TGG CTG GCC TTC ATC CCC ATC 
Phe Thr Met Tyr Thr Thr Cys lie Val Trp Leu Ala Phe He Pro He 
790 795 800 

F/a 8E 
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TTT TTT G6C ACC TCA CAG TCA GCC GAC 
Phe Phe Gly Thr Ser Gin Ser Ala Asp 
805 810. 

ACA CTG ACG GTC TCC GTG'AGT CTG AGC 
Thr Leu, Thr Val Ser Val Ser : Leu Ser 

825 



CTC TAC ATG CCC AAA GTC TAC ATC ATC 
Leu Tyr Met Pro Lys Val Tyr lie He 
840 845 

GTG CCC AAG CGC AAG CGC AGT CTC AAA 
Val . Pro Lys Arg Lys Arg Ser Leu Lys 
855 860 

ATG TCC AAC AAG TTC ACA CAG AAG GGC 
Met Ser Asn Lys Phe Thr Gin Lys Gly 
870 875 

GCC AAA TCA GAG CTG TGT GAG AAC CTG 
Ala Lys Ser Glu Leu Cys Glu Asn Leu 
885 - . 890 



AAA CAG ACC TAC GTC ACC TAC ACC AAC 
Lys Gin Thr Tyr Val Thr Tyr Thr Asn 

905 

3269 

GCGGAGCCAA GCAGGCTAAG GAGCCACAAC CTCTGAGGAT GGCACATTGG GCCAGGGCCG 

3329 

TTCCCGAGGG CCCTGCCGAT GTCTGCCCGC CTCCCGGGCA TCCACGAATG TGGCTTGGTG 

■ '3389 

CTGAGGACAG TAGAGACCCC GGCCATCACT GCTGGGCAAG CCGTGGTGGG CAACCAGAGG 

.'■ . ■ ' 3449 

AGGCCGAGTG GCTGGGGCAG TTCCAGGTTA TGCCACACAC AGGTCTTCCT TCTGGACCAC 

3509 

TGTTGGCCCA GCCCCAAAGC ACAGGGGCTC GGTCTCCAGA GCCCAGCCCT GGCTTCCTCT 

; 3569 

CCTTCCTCCT GCCTCCGTCT GTCCTGT.GGG TGACCCCGGT TGGTCCCTGC CCCGTCTTTA 

: ■. 3629' 

CGTTTCTCTT CCGTCTTTGC TCTGCATGTG TTGTCTGTTT GGGCCCTCTG CTTCCATATT 

m 8E 



2923 

AAG CTG TAC ATC CAG ACA ACC 
Lys Leu Tyr He Gin Thr Thr 
815 820 

2971 

GCT TCA GTG TCC CTG GGG ATG 
Ala Ser Val Ser Leu Gly Met 
830 835 

. 3019 

CTC TTC CAC CCG GAG CAG AAC 
Leu Phe His Pro Glu Gin Asn 

850 

3067 

GCC GTG GTC ACC GCC GCC ACC 
Ala Val Val Thr Ala Ala Thr 
865 

3115 

AAC TTC AGG CCC AAT GGG GAA 
Asn Phe Arg Pro Asn Gly Glu 
880 

3163 

. GAG ACC CCA GCG CTG GCT ACC 
Glu Thr Pro Ala Leu Ala Thr 
895 900 

. 3209 

CAT GCC ATC TAGCCGGGCC 
His Ala He 
910 
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TTTCCATTCT GCTCCTGGCC 
AGCTGCCCCA TCCCCGCCAT 
GCCTTCCCCC TACTTGGCTT 
AAAGGGGGGG GGGAATCACC 
TTTCGAAGTG CTGCGTCCTC 
CGCCCTACCC GTCTGCCGTG 
AACGACACGG AGTTCAGTGC 
TTTTTATACC GAGAACATTT 



29/32 . 

TTGCCCTGCC ATCTGCCCTG 
CATTTTCTCT TCTGTTCCCC 
CATCCACCAA CTCTTTGACC 
CCCTACAAAA AAGCCCAAAC 
CTGGTGGCCT GTGTGTCCCT 
TGTCCTGCCC CCCCCGCCTG 
CTGGGTGTTT GGTGATGGTC 
CTAATAAAGA TAAACACATG 

na 8 a 
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3689 

CCCCCTGCCC CTCCTCCCTG 

3749 

CTCGATCTCA TTTCCTACCA 

3809 

ACGTTGCAAA AGAGAAAAAA 

3869 

AAAAACTAAT CTTGAGTGTG 

3929 

GTGCCTGCAG CCTGTCTGCC 

3989 

CCCGCCTTGC CCTTCCTGCT 

4049 

TCTGATGTGT AGCATGTCTG 

4096 
GTTTTGC 



SUBSTITUTE SHEET 



WO 92/10583 PCT/US91/09422 

30/32 

< 60 

CCCAACATCA CGTT6GGCGC CCGCATTCTG GACACCTGCT CGAGGGACAC CCACGCCCTG 

120 

GAGCAGTCAC TGACCTTTGT GCGGGCGCTC ATCGAGAAGG ACGGCACGGA GGTCCGCTGC 

< 

180 

GGCAGGCGGG GCCCGCCCAT CATCACCAAG CCCGAACGAG TGGTGGGTGT CATTGGAGCT 

240 

TCGGGGAGCT CCGTCTCGAT CATGGTGGCC AACATGCTCC GCCTCTTCAA GATCCCTCAG 

300 

ATCAGCTATG CCTCCACGGC CCCTGACTTG AGTGACAACA .GCCGCTATGA CTTCTTCTCC 

360 

CGGGTGGTGC CCTCAGACAG ATACCAGGCC CAGGCCATGG TGGATATTGT CCGAGCCCTC 

420 

AAGTGGAACT ATGTGTCCAC ACTGGCCTCA GAGGGCAGCT ACGGTGAGAG TGGTGTGGAG 

480 

GCCTTTATCC AGAAGTCCCG AGAGAACGGA GGTGTGTGCA TTGCCCAGTC GGTGAAGATT 

. 540 

CCACGGGAAC CCAAGACGGG GGAGTTCGAC AAGATCATCA AACGCCTACT GGAAACATCC 

600 

AATGCCAGGG GTATGATCAT CTTTGCCAAC GAGGATGACA TCAGGAGGGT G17GGAGGCA 

660 

GCTGGCAGGG CCAACCAGAC CGGCCACTTC TTTTGGATGG GTTCTGATAG CTGGGGCTCC 

720 

AAGAGTGCCC CTGTGCTGCG CCTTGAGGAG GTGGCCGAGG GCGCAGTCAC CATTCTCCCC 

; 780 
AAGAGGATGT CTGTTCGAGG GTTCGACCGA TACTTCTCCA GCCGCACGCT GGACAACAAC 

840 

AGGGGCAACAr TCTGGTTTGC CGAGTTCTGG GAGGACAACT TCCATTGCAA GTTGAGCCGC 

j 900 
CACGCGCTCA I AGAAGGGAAG CCACATCAAG AAGTGCACCA ACCGAGAGCG CATCGGGCAG 

960 

GACTCGGCCT ATGAGCAGGA GGGGAAGGTG GAGTTCGTGA TTGACGGTGT GTACGCCATG 

i 1020 
GGCCACGCGC TGCACGCCAT GCACCGTGAC CTGTGTCCCG GCCGCGTAGG ACTCTGCCCT 

1080 

CGCATGGACC CCGTGGATGG CACCCAGCTG CTTAAGTACA TCAGGAACGT CAACTTCTCA 

1140 

GGCATTGCGG GGAACCCTGT AACCTTCAAT GAGAACGGAG ACGCACCGGG GCGCTACGAC 

m sa 

i 

SUBSTITUTE SHEET - 



WO 92/10583 ^ PCT/US91/09422 

31/32 1200 
ATCTACCAGT ACCAACTGC6 CAATGGCTCG GCCGAGTACA AGGTCATCGG CTCGTGGACA 

. 1260 

GACCACCTGC ACCTCAGAAT AGAGCGGATG CAGTGGCCAG GGAGTGGCCA GCAGCTGCCG 

1320 

CGCTCCATCT GCAGTCTGCC CTGCCAGCCC GGGGAGCGAA AGAAGACTGT GAAGGGCATG 

1380 

GCTTGCTGCT GGCACTGCGA GCCCTGCACC GGGTACCAGT ACCAAGTGGA CCGCTACACC 

1440 

TGTAAGACCT GCCCCTACGA CATGCGGCCC ACAGAGAACC GCACGAGCTG CCAGCCCATC 

1500 

CCCATCGtCA AGTTGGAGTG GGACTCGCCG TGGGCCGTGC TGCCCCTCTT CCTGGCCGTG 

1560 

GTGGGCATCG CCGCCACGCT GTTCGTGGTG GTCAGGTTTG TGCGCTACAA CGATACCCGC 

1620 

ATCGTCAAGG CCTCGGGCCG GGAGCTGAGC TACGTGCTGC TGGCGGGCAT CTTTCTGTGC 

1680 

TAGGCCACTA CCTTCCTCAT GATCGCAGAG CCGGACCTGG GGACCTGTTC GCTCCGCCGC 

1740 

ATCTTCCTAG GGCTCGGCAT GAGCATCAGC TACGCGGCCC TGCTGACCAA GACCAACCGC 

1800 

ATTTACCGCA TCTTTGAGCA GGGCAAACGG TCGGTGAGTG GCCCGCGTTT CATCAGCCCG 

1860 

GCCTCGCAGC TGGCCATCAC CTTCATCCTC ATCTCCCTGC AGCTGCTCGG CATCTGCGTG 

1920 

TGGTTCGTGG TGGACCCCTC CCACTCGGTG GTGGACTTCC AGGACCAACG GACACTTGAC 

1980 

CCCCGCTTTG CCAGGGGCGT GCTCAAGTGC GACATCTCGG ACCTGTCCCT CATCTGCCTG 

2040 

CTGGGCTACA GCATGCTGCT GATGGTCACG TGTACTGTGT ACGCGATCAA GACCCGAGGC 

2100 

GTGCCCGAGA CCTTCAACGA GGCCAAGCCC ATCGGCTTCA CCATGTACAC CACCTGCATT 

2160 

GTCTGGCTGG CCTTCATCCC CATCTTTTTT GGCACCTCAC AGTCAGCCGA CAAGCTGTAC 

2220 

ATCCAGACAA CCACACTGAC GGTCTCCGTG AGTCTGAGCG CTTCAGTGTC CCTGGGGATG 

2280 

CTCTACATGC CCAAAGTCTA CATCATCCTC TTCCATATTT TTCCATTGTG CTCCTGGCCT 



r/a ss. 
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32/32 2340 
TCCCCTGCCA TCTGCCCTGC CCCCTGCCCC TCC.TCCCTGA GCTGCCCCAT CCCCGCCATC 

2400 

ATTTTCTCTT CTGTTCCCCC; TCGATCTCAT TTCCTACCAG CCTTCCCCGT ACTTGGCTTC 

' 2426 
CTCCACCAAC TCTTTCACCA CGTJGC 



na 9c 
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